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WHERE DO WE GO FROM HERE? 



The era spanning the ft ninerSer?-s§venties through the 
nineteen-eighties might" at first -suggest revolutionary 
\nd dynamic breakthroughs in the field of language 
assessment. Monumental capacities' of computer-: 
enriched statisticahapproaches have seemingly yielded 
a plethora of research on the one hand, with veiled ^ 
inferences ,of expanding existing ^knowledge, ad 
infinitum.' From another pejspective, the metaphorical 
geometric rate of amassing this data^ has perhaps 
outdistanced our arithmetic ability, to process this mass 
and conceptualize adequate frameworks. However, a 
corollary t& this statement might still be equally argued 
and applicable to language^ assessment, among other 
areas, in that a few visionary prophets may have 
outdistanced the parameters of current technology and 
await the time when reificatldn of constructs, (sort of 
speaking) bridges the gap between the theoretical ^nd 
empirical. 

I am reminded somewhat of Zeno's paradox presented 
by Troike in an earlier volume this series (Seidner, 
1982). In reaching our hypothetical goals, we appear to 
cover .half the distance, then half £>f the remaining 
distance, ad infinitum. In essence, all victories; then,* 
become partiaf victories. But what do they mean? It 
would seem that this .question typifies every era of 
human endeavor. Think of the immediate sensation" 
caused by Fechner, for example, who converted 
Weber's Law into a 'psychological scale (on this, see 
Boring, 1942. 1961). Nevertheless, the ensuing activity 
and controversy 'in Germany had prompted William 
James (1890) to write: # y 

"AH& what good came of it at last?" *■ 
" Quoth little Peterkin. 
"Why that I cannot tell," said he, * t i 
"But twas a famous victory!" 

'The radical impossibility of disassociating experimental 
approaches from a current theoretical framework, 
compounds individual attempts to transcend the^. 
figuratively stated arithmetic rate. Here, Claude Bernard 
(1865) would also warn of an extreme attachment to 
existing frameworks, where men "with excessive faith 
in their theories," are ill-disposed toward making new 
discoveries and shedding bias frSm" their recorded 

^observations: 

,l .would suggest here that the lack of cpnsensus in 

definitions of terms alternately impedes and accelerates. 

this arithmetic rate, compared to the current geometric 
a influx of information. A brilliant exchange between 

John Oiler and John Garroll brought forth theater's 
^personal communication on th£. relationship of the "g" 

factor to language proficiency, and the observation.that 
/ final conclusions would rest upon specific definitions 'of 
/ intelligence*tend language proficiency (Oiler. 1983). 

Traditionally, theoretical frameworks haye. developed , 
" cautiously for the most part unde/' the scrutiny of 
..' orthodoxy. However, the existence of varied definitions 
■ . might enhance developments "in other related areas. If 
• Cbnant's (1947) concept of the "horror vacui" dictated. 



in* turn, that' contradictory evjdence^n itself would not 
supplant an established theoretical framework until a . 
better one arose, a newly formulated framework would 
have to reflect- sonie elementeof established tenet. We 
might fmd then 'that the dernnitiorvof communicative 
competence* advanced -by Carhale and Swain" (Seidner, 
1982) illustrates v a' demarcation^ from perspectives of . 
Munby (1978), Hymes (in Pride ar/d Holmes, 1972) or 
Savignon (1972) in °some " respects, but forms a 
consensus of agreement along other lines. The 
operational definition of • communicative competence • 
(arbitrarily labeled Z), viewed here fundamentally as the 
Verbal account of ^particular manipulation^ and 
computations, becomes suject to what Carnap viewed 
as testability, confirmability anda/erifiability (i.e. where 
verifiability is characterized with aft incompleteness in 
attaining a definitive add final\sta"blishment of "*ruth"). 
If we then view a hypothetical psessment instrument * . 
of communicative competence, associated with our 
definition Z, it is the interpretation of the data 
emanating from the particular procedure which is 
validated, ; (as indicated by Cronbach in Thorndike, 
1971) in 'opposition to the viewing of validity, for. 
example, as a specific characteristic of the test. An., 
overall view of progress in trje^historical development in 
language assessment then, might be likened tp the 
spiral-like process attributed to Rousseau in the quest 
of some absolute truth or perfection (De Balla, 1.973). ' m , 
Each advancement is subject to {he flat plateau of the 
spiral and awaits the processing of data and emergence %. 
of an adequate conceptual framework. The time factor 
in proceeding along the plateau and through the spiral 
to the next level varies in its speed and is dependent 
upon a myriad of factors which cannot be fully explored 
here. In*' essence, ,we encounter the transformation . 
made by Boring (1936) of Occam's "razor" from "entia> 
* non sunt multiplicanda praeter necessitatem" to "entia . 
^sunt multiplicanda propter necessitem" 

If we agree Xith Day. McCollum, et al. in Erickson and 
Omark (1981) that discrete : point testing emerged as 
the natural result of structural linguistic approaches 
(influenced by behavioral psychology) to language 
acquisition, the advance along the plateau through the 
figurative spiral to the next one was relatively a 
chronologically shorter one. Perhaps the same may be 
true for adherents of vwhat Carroll (1961) termed as 
integrative'philosophy. An example of this trend may 
emerge again with some recent stirrings, associated 
with the reexamination qf the so-called unitary factor 
hypothesis by Farhady (1980, and Oiler, 1983) as well 
as by Briere '(198?)) among others. If a shortly, 
forthconning breakthrough were, per se,- to Occur from 
this direction in the design and implementation of new 
language assessment instrumentation, would it be 
mostly attributed to the aforementioned .advances* in 
computerized statistical applications? Indeed, the 
studies cited in the example appeared to rely upon . 
sophisticated statistically packaged approaches. First 
appearances might suggest that it constitutes a 
dete'rmining .factor in accelerating spiral-like 
advancement. Consider thd comparison of approaching 
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factor analysis from Xhe 'actual raw data or from a 
correlation matrix. Statistical programming affords the 
individual readily available approaches to address the 

o* commonalities in the diagonals of correlation matrices. 
The. inclusibrTof an oblique (or correlated) method of 
rotation to "reconcile" highly loaded factors, or perhaps 
the more popularly used orthogonar, (or uncorrected), 
varimax method (developed by Kaiser, 1958) provide an 
added sophistication in .manageable statistical 

. calculations. If these and .other approaches appear to 
confirm the hastening of developments along the spiral, 
I would suggest a resistance factor — or, perhaps, i 
check and balance where research, utilizing the 
normal curve, - conforms to the confines of , the 
prede'term/ned standard (See Seidner. 1982-b), 

.' Returning to Zeno's paradox, the seemingly quickened 
advance ^iQng the pjfiteau which spirals upward might * 
be .no more than illusory, in view of the attainment of 
the half: of the half, qf the half, which is perpetual. Given 
another "perspective, the computerized applications ° 
might indeed farove a quagmire in s instances of 
statistical applications hindering, or perhaps shaping, 
the' direction of theoretical formulation, detracting from 
necessary foundations of the fields of the origin. Where 
ao we qq froal here? 

/ ' r * r 

/ Papers by John Oiler and Jim Cummins (Part I) offer us 

/ some prospects for new frontiers and breakthroughs. 
Oiler presents an intriguing blend of biological, 
psychological and linguistic approaches to the question 
of language-based intelligence. Cummins presents us 
with a theoretical framework. ■ proposing the 
conceptualization of language proficiency along several 

<v continua. Muriel Saville;Troike's study (Part Ml) 
substantially ■ adds to 'our understanding ■ of 
communicative and academic competence in teaching 
and testing situations. Benji Wald (also Part ll|) offers 
" important findings on the relationship between 
spontaneous and test speech, utilizing a sampl^ of 
10-1 2-year-old Hispanic bilinguals. These and other 
cpntributioris are found in'the three categories within 
this yolumer Foundations. Assessment Approaches. 
Research and Policy. As with Volume I; it is with the 
tUpught of bringing current issues to the forefront that 



the Second Language Assessment institute, was 
conducted in, Chicago. /Illinois, this volume was 
conc'eived as the second in a series to include deliveries 
by participants and invited papers from theoreticians" 
and practitioners in the field. Sponsors of the Institute, 
whicrj will be repeated annually, include the National 
Clearinghouse for Bilingual' Education; Educational 
Testing Services; the Evaluation, Dissemination fend 
Assessment Center (Dallas, Te*as); the Bilingual 
EdJbation Service Center (Illinois); the Illinois State 
Bilingual Office; the Illinois Resource Center; and' 
National College of Education. ; *.. • 

Behind the scenes in the production of any volume of 
this nature are the unsung heroes who commit 
countless hours of toil and dedication. It is with grateful 
thanks and appreciation that acknowledgement t is 
made to Jean Honeyman. Manager. Staff Support 
Services. Illinois State Board of Education; to Delores 
Johnson. Word Processing Supervisor; and to Becky 
Copelin. Typesetter. Staff Support Services, who so 
willingly gave so much time and expertise .in the 
typesetting of this project. Bill Becker. Manager. Illinois 
State- Board of Education. Internal Office' Supporting 
and Printing Section, and 'Bernie Neff. Assistant 
Meager. Printing and Graphics, are' invaluable in 
providing much needed material support and creative 
ideas. Special gratitude goes to Linda McElroy who 
spent many hours poring over the manuscripts and is 
deserving of the accolade, "an editor's editor," Cornelia 
Powell . and Steve Rothenberg added their much 
appreciated talent in helping design the cover and other 
•graphics. I would also like to express my indebtedness 
to Maria Medina Seidner. Manager. Bilingual Education 
Section, for her assistance and support in ' the 
preparation ofMhis volume. Fir/ally, my appreciation 
goes to the contributors' of this volume for their 
excellent presentations and papers. 

Editor * 
Stanley's. Seidner. Ph.D 
Associate Dean for Research and Development 
National College of Education 
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CONCEPTUAL AND LINGUISTIC.FOUNDATIONS OF LANGUAGE ASSESSMENT 



Jim Cummins 
The Ontario Institute for Studies m Education 



Many of the most contentious debates in the areas of 
psycholinguistics and educational psychology during 
the past 20 years have revolved around the issue of how 
"language proficiency" is related to academic 
achievement. Disagreement about appropriate ways of 
conceptualizing the- nature of language proficiency 
underlies controversies as diverse as the extent to 
which "oral language" is related to the acquisition of 
reading (e.g., Wells, 1 981 ). the extent to which learning 
disabilities are ir\ reality language disabilities (e.g., 
Vellutino, 1979K and. the extent to which poor school 
achievement of low socioeconomic status (SES) and 
minority group students is caused by- differences in the 
language use patterns of theSe students in comparison 
to middle crass students (e.g.. Labov, 1 970). 
J 

The question of what constitutes "language profi- 
ciency" and the nature of its cross-linguajjjimensions is 
also at the core of many hotly debated issues in the ■ 
areas of bilingual education and second language 
pedagogy and testing. Researchers have suggested 
•ways of making second language teaching and testing, 
more "communicative" (e.g. Canale & Swain' 1980; 
Oiler, 1979) on the grounds <that a communicative 
approach better reflects tfte nature of language 
proficiency than one which emphasize^ the acquisition 
of discrete language Skills. Issues such as the .effects of 
"bilingual education on achievement, the appropriate 
age to begi.n teaching. L2, and the consequences of 
"TTTferaal patterns of bilingual language u^e-in the home 
minpVity students' achievement are ail intimately 
related to the broader issue of how L1 proficiency is 
related to the development of L2 proficiency. This issue, 
in turn', clearly cannot be resolved without an adequate 
conceptualization of Jhe nature* of "language 
proficiency." . ' 

I shall first briefly review the current state-of-the-art in 
bilingual language proficiency assessment and then 
describe some of the detrimental consequences, lor 
minority students' academic development which derive 
from popular assumptions, afrout the nature of "English 
proficiency." Finally, I will outline a theoretical 
framework for relating' language proficiency to. 
academic development and suggest some applications 
of the framework for both assessment and pedagogy. 1 

Language Proficiency Assessment in Bilingual Programs 

A cursory examination of the many tests of language 
proficiency and dominance currently available for . 

Y ■ 



assessing ' bilingual students, (see, e.g., DeAvila & 
Duncan, 1978; Dieterich, Freeman and Crand'elf 1979) 
reveals enormous variations ffPwhat they purport" to 
measure. Of tfge 46 tests examined by DeAvila and 
Duncan , (1978). only four included a measure of 
phoneme production, 43 claimed to measure various 
levels of lexical ability' 34 included items assessing oral 
syntax comprehension and nine attempted to assess 
pragmatic aspects of language. 

This variation in jangufrge tests is not surprising in view rt 
of the lack of consensus as to the nature of language 
proficiency or .communicative competence." For 
example, Hernandez-Chavez, Burt and Dulay- (1978) 
have outlined a model of language proficiency 
comprising 64 separate components, each of which, 
hypothetical ly- at least, is independently measurable. By 
contrast. Oiler and Perkins (1980) have argued that: 

"a single factor of global language proficiency 
seems to account for the lion's share of 
variance in a wide variety of educational tests 
including nonverbal and verbal IQ measures, 
achievement batteries, and even personality 
inventories and affective measures... the result 
to date are. ..preponderantly in favor of the 
assumption that language skill pervades every 
area of the school curriculum even more 
strongly than was ever thought by curriculam 
writers or festers." (p. 1). 

This global dimension is not regarded by Oiler (in press) 
as the only significant factor in language proficiency, 
.but the amount of additional variance accounted for by 
other factors is relatively modest.. 

The considerable evidence that Oiler and his colleagues 
(e.g. Oiler & Streiff, in press)* have assembled to show 
that academic and cognitive variables are strongly 
related to at least some measures of all four general 
language skills (listening, speaking, reading, and 
writing) raises an important issue for the assessment of. 
entry and exit, criteria in bilingual programs* tcb what 
extend should measures of language proficiency be 
* related to measures of acadefnic achievement? In other 
words, to what extent does the*constructof language 
proficiency overlap with the constructs of "intelligence" 
and academic achievement? 

This theoretical question has rarely been asked, instead, 
researchers have either asked only th(^ empirical 



The noocj for n theoretical framework explicitly designed to relate lanmitujn proficiency to acndomic achievement was brought home to mo at tho 
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question of* how language proficiency /s related to 
achievement (often expressed in terms of the relaticfn 
between "pr'al language" and jeacjmg) or else ignored 
the iss*ie entirely, presumably because they do not 
consider it relevant to language proficiency assessment 
in bilingual education. However, the theoretical issue 
cannot be avoided. The relationship of language 
proficiency to academic achievement must be 
considered inSvJe_wf ol-the.Jact that a central pur0t>se in 
assessing minority students' language dominance 
patterns is to assign students to classes taught through 
the language in -which it is assumed they are most 
capable of learning, and in which they will most readily 
acquire academic skills. If measures of language < 
proficiency bear no relationship to students' acquisition 
of academic skills, their relevance in the context of 
entry and exit criteria is open to question. This issue 
requires theoretical resolution, rather than 'empirical, 
because, as will be discussed below, some language- 
measures correlate highly with achievement, while 
others show a negligible relationship. Without a 
theoretical framework within which language 
proficiency can be related to 'the development of 
academiC skills, there is no basis for choosing between 
alternative" tests which are clearly measuring very 
different things under the guise of "language 
proficiency." s 

Essentially, what is at issue are the criteria to be used in 
determining the validity of language proficiency 
measures in the specific context of bilingual education. 
Whether we are talking about content, criterion-related, 
construct, face, or ecological validity, our procedures 
for determining validity are always based on'a theory 
regarding s the nature of the phenomenon being 
measured. In many cases, however, this theory has 
remained implicit in language test development for 
bilingual students, and where the theory has been made 
explicit, the construct of language proficiency has' 
usually been regarded as independent of the constructs 
of intellectual and academic abilities. 

Thus, it is reported (see Oakland, J 977, p. 1 99) that on 
the Basic Language Competence Battery there is little 
or no increase in scores across the elementary grades 
v among native speakers, This is interpreted as evidence 
for tbe construct validity of the battery in that it is 
indeed measuring "language knowledge," rather. [b»n 
intellectual abilities or. educational achievement, In 
arguing against' "language deficit" theories, many 
socioltnguists (e.g., Labov, 1970; Shuy, 1077) have 
similarly asserted that language proficiency is 
independent of cognitive and academic performance. 
Shuy (1977, p*5), for example, states that "rather 
compelling evidence rejects every claim made by those 
who attempt to show linguistic correlates of cognitive 
deficit. 3 



One apparent implication ofMhe theoretical ^position 
that "language proficiency" is independent of 
intellectual abilities and academic achievement is that 
language measures such as the integrative tests (eg 
oral cloze, -dictation, elicited imitation) used in the 
research of Oiler and others (see Oiler & Perkins. 1980. 
Oller'8. Streiff. in press) would have to be rejected as 
'invalid to assess the construct of language proficiency" 
.because of their strong relationships to achievement 
and ICT " 

> - 

Many. theorists would regard any form of contrived test 
situation as inadequate to assess language pro'ficiency.* 
arguing instead for»procedures which assess children's 
language in naturally occurring communicative 
Situations (e.g. Cazden. Bond.. Epstein. Matz. & 
Savignon, 1977; Dieterich et al.. 1979). For example, 
Dieterich et al. argue in relation to an elicited imitation' - 
task that "it mirrors no real speech situation and is thus 
of questionable validity in assessing proficiency." (1977, 
p. 541). . 

Although the requirement that proficiency measures 
reflect "naturally occurring speech situations" is a Wmc 
principle -of validity for Ipany theorists, few pursue the 
issue to inquire whether or not the communicative 
demands of'natural face-to-face situations are identical ^ - 
to the communicative demands of classroom situations. - > 
In classrooms, students' opportunity to negotiate 
meaning with the interlocutor (teacher) i? considerably 
reduced a^ a result of sharing him or her with about * * 
25-30 other students, and there is considerable 
emphasis on developing proficiency in -processing 
written text where the meaning is supported largely by 
' linguistic cues, rather than the richer "real-life" cues* of 
face-to-face communication. 

These issues are bjjing raised not to argue against the *' 
assessment of "language proficiency." in naturally 
occurring situations, but rather to show the need for a 
theoretical framework which would allow the construct 
of language proficiency to bo conceptualized inflation 
to the 'acquisition of academic skills in bilingual 
programs. The urgency of this need can be seen from 
the fact that the most commonly used tests of language 
proficiency nnd'dominance for minority students clearly 
embody different theoretical assumptions in regard to ^ 
the relationship tratwcen language proficiency and 
achievement The language Assessment Scales (LAS) 
(DeAvila & Duncan, 197/):. for example, arc reported to 
consistently^ show moderate correlations with 
academic achievement, whereas the Bilingual Syntax 
Measure (BMS) (Burt, Dulay, & Hernandez-Chavez. 
1975) and the Basic Inventory of Natural Language 
(BIND (Herbert, 1975) tend to show much lower 
N correlations with achievement (see Rosansky. 1981 for 
a review) All of these tests showed lower correlations 
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with. achievement than teachers' . rations >f student** 
chances U'f a<adermc achievement if irjMru :ted onlv m 
Efiflirsh (U'rUim ^Spunier' S hSvas. 1080) The; t Richer 
vanabk" a; counted for -11 pe r i:cn| of the v l ift;uin ,, in 
reading achievement scores ajid the BlNL. BSM. and' 
LAS acMocfonly zero, one and tour percent, n^spe^taoly 
to the prediction of reading achievement 

.Apart from the issue o'f their r cl.itiuribhip to academic 
achievement, the validity of these tests can be 
questioned on several other grounds For example. 
Bosansky (1981) points out that data* elicited by the 
BSM-English were unrelated to data elicited from taped 
naturatatic conversation of the same individuals 'The 
LAS Spanish language classification is reported to 
considerably underestimate the Spanish proficiency of 
native Spanish speakers as assessed by tyther teacher 
ra 1 1 rigs or detailed ethnohnguistic analysis of children's 
speech i «p a range of settings (Mace-Matluck. 1980) 1 

This brief survey of assessment issues in bilingual 
education suggests that a .major reason for the 
confused sidle of the art is that the developmental 

'relationships between language proficiency On LI and 
L2) and academic performance "have scarcely been 
considered, let alone -resolved The. confusion a^out t ho 
assessment'Of "language proficiency'' is reflected in the 
varied criteria used to exit language minority students, 

• from bilingual programs 



''English Proficiency" and Exit Criteria 



Lack of English proficiency is commonly regarded by 
policy-makers and educators as the major cause of 
language minority students' academic failure inr 
Enghsh-only programs Thu\ it is. assumed that 
students, require bilingual instruction t only until they 
have become proficient hi English Logically, after 
students*- have become "proficient in English; ahy 
difficulties they might encounter in an English-only 
program cannot be attributed to lack of English 
proficiency 

If we combine tins apparent logic with the fact thtlt 
immigrant students generally appear to acquire n 
reasonably high level of L2 fluency wLthin about 1 '/? 
' 2 years of arrival in the host country (Cummins. 1080c, 
Snow ^ Hoofnagol Hohlo, 1<)78), then one .might 
assume that two years of bilingual education should ho 
♦sufficient for students to make the transition to on 
English-only program This line ot reasoning is 
frequently invoked to justify oxiting-^students out of 
bilingual programs after a relatively short period. It is 
assumed that because students can cope adequately 
witfi the communicative demands of face to-faco 
situations and appear guile fluent m English, then 
English proficiency 'is sufficiently well developed to 
cope with the communicative demands of thn regular 
English only curriculum on an equal basis with native 
( nghsh speaking students k i 

fhnro i?; considerable evidence to suggest thai the 
logic in false Hilingual program:; which have ho^n 
successful in developing a lugh level of [nglejh 



academic skfll% in language minority students have 
usually maintained instruction in L1 throughout 
elementary school Usually *t is only in the latter,grades 

-of elementary school Jhaj. students approach grade 
norms in English reading skills (see Cummins, 1981 for 
a review). In a similar way., it has been shown 
(Cummins. .an press a>^ that it took immigrant studentsr 
whp arrived in Canada after the age of six. 5-7 y.ears'On 
the average, to approach grade, norms in academically 
related aspects of English proficiency Thus/it clearly 
takes considerably . longer for language minority 
students to develop age-appropriate ac*)derfnc skills in 
English than it- does to develop certain aspects of 
age -appropriate English face- to -face communicative 
skills It follows, that students oxrted on the*"basis of 
teacher judgments or language tests which primarily 
assess face-to-face communicative skills are likely to 
experience considerable academic . difficulty in an 

-EngJish-only program, anp 1 many .will manifest the 
well-documented pattern ol cumu la tive deficits. 

The dangers of unanalyzed notions of what constitutes 
""English proficiency" can'be illustrated by an example 
from a Canadian study in which the teacher referral 
forms and psychologicalassessments of 428 language 
minority, students . were analyzed (Cummins; 1980c). 
This particular child (PR) was 1 first referred in grade 1 by** 
the school principal who noted that 

"PR is experiencing considerable difficulty with 
grade 1 work. An intellectual assessment 
would help her teacher to set realistic" learning 
: expectations for her'and might provide sorYie 
clues as to remedial assistance that might be 
offered." . ' 

^ > ■ . 

c No mention was, made ^of the child's English-as-a- 
second-language (ESL) background;, this only emerged 
* when the child was referred by the grade 2 teacher in 
the following year. Thus, the psychologist 'does not 
consider this as a pp^sible factor in accounting for the 
discrepancy between a. Verbal * IQ of 64 apd a 
Performance IQ of 108. The assessment report read as 
follows: 

"Although overall ability level appears to be 
within (He* low. average range, note .the 
SKjrnficartl^jtfif for once 'between verbal qnd r 
nonverbal score's ... It would appear that PR's 
development has riot progressed at a normal 
rate and;<jonsoquontly she is. and will continue 
to oxptffluncu much difficulty in school. 
Teacher's expectations (at this time) should be , 
set accordingly 

What is interesting in tins example is tfiat the child's, 
face to-faee eornnumioat ive skills are presumably 
k sufficiently wHI developed that the psycfiologist (and 
possibly the teacher) is* not alerted to her £SL. 
* "background, This load;;, the psychologist to infer from 
hoi low verbal IQ scorn that "her development has not 
progressed at a normal rate" and to advise the teacher 
u » set low academic expectation!; for tfvu child since she 
"will continue to experience much difficulty in school," 
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There is ample evidence from many contexts (e.g. 
— Mercefr 1973) of how. the attribution of deficient - 
cognitive skills to language minority students can 
become self-fulfilling^. ' . ■ ' 

In many of the referral forms- and N psychological 
assessments analyzed in this study the following line of 
reasoning was invoked: ' 

BecauseManguage minority students, are fluent 
in English, their poo; academic performance 
and/or teat scpres eannot.be*attributed to lack 
of proficiency in English. Therefore, these * - 
p , studenfs must* either fiave deficient cognitive* 
* abilities. or be pobrly motivated ('lazy'). 

In°a similar way, when language minority students are 
exited from bilingual programs on the basis of fluent 
English communicative .skills] it appears that their 
subsequent academic difficulties cannot logically be 
attributed to "lack of English proficiency:" ..Thus, 
educators are likely to attribute these difficulties to 
factors within the student such as "low' academic, 
ability" (IQ). ' ^ ■ % 

These misconceptions derive from the fact that the * 
relationships betyveen "language proficiency" and 
academic development have npt been adequately 
considered, erther among native. English-speaking or 
language minQrity students. In the remainder of this 
pap.er, . a theoretical framework is developed for 
conceptualizing these relationships. 

/A Theoretical Framework 3 , 

Ibn'the bfes'of the foregd|ng analysis of the confusions 
which exist, both in current language proficiency 
assessment techniques *and.Jn. .procedures-4or^exiting 
students from bilingual programs,, threje minimal 
requirements for a theoretical framework of language 
proficiency relevant to bilingual education in the United 
States can be outlined: First, such a framework must 
incorporate au.developmental perspective such that 
those aspects of language proficiency which are 
mastered early by native speakers and L2 learners can 
be distinguished from those that continue to vary 
across individuals as development progresses; second, 
the framework must be^capable of allowing differences 
between the linguistic demands of the school arid those 
of interpersonal" contexts outside . the school to be 
described; third, the framework must be capable of 
allowing the developmental relationships between LT1 
and L2 proficienpy to be described. 

o 

Current theoretical frameworks * of "communicative 
competence" (e.g. Canale, 1981; Canale & Swain, 1980) 
do not (and were not intended to) meet these 



requirements,. Canale (1981) distinguishes grammatical, 
sociolinguistic, discourse and strategic pompetencies, 
but states that their relationships with eafch other and^ 
with world knowledge and academic achievement is an 
empirical question y^t to be addressed. Although this^ 
framework is* extremely useful for some purposes, its 
» applicability to bilingual edffcajidn is limited "by its 
static nondevelflpmentai nature and. by thj? fact that the 
relationships between academic performance and the 
components of communicative competence in 11 and 
L2 are pot considered. Fo^exartfple, both pronunciation 
'and lekical knowledge would be classified unden 
grammatical competence. Ypt L1 pronunciation ifi 
mastered very early by native speakers, whereas fexicfil 
knowledge continues to develop throughout schooling 
and is strongly related to academic performance. 

The framework/outlined below is an attempt to 
cohceptulfee "language proficiency" in such a wayjthat 
the developmental '.interrelationships between 
academic performance and lapdjtjage 'proficiency in 
both L1 and L2 can be consjdefed. It is proposed only m 
relation to the development . of .academic, skills' in 
bilingual education an#is not nepessajjjy appropriate or 
applicable to other, contexts orlssues, Essentially,-4(ie 
framework tries, to integrate the earlier distinction 
between basic interpersonal communicative skills 
(BICS) and cognrtive/academic language proficiency 
(CALP) into a^tjref general theoretical model, The 
BICS^ALP distinction was intended to make the same 
point that was made earlier in this paper, namely, 
academic deficits are often created by teachers and 
•psychologists who fail to realize that jt takes language 
minority students considerably ' Icinger to attain 

". Qrade/age-appropriate levels in English academic skills 
than it does in English face-to-face communicative 
skills However, as is fjginted out <in other papers in this 

./volume, such a dichotomy oversimplifies/ the 
phenomena and risks misinterpretation. It is" also 
difficult to discuss the crucial developmental issues in 
terms of the BICS-CALP<dichotomy. 

The framework presented in Figure 1 proposes that in 
the context of bilingual education in thje United States 
"language proficiency" can t^e« conceptualized ajong 
two continuums; First is a continuum relating to the 
range of contextual support available for expressing qr, 
receiving meaning. The extremes of this continuum are 
described in terms' of "context-embedded" versus 
"context-reduced" communication. They '. are. distin- 
guished by the fact that ^ in dontext-embedded 
communication the particjpantlslcan lively negotiate 
meaning (e.g. by providii)^e6dtkck that tHe/message 
has not been understood) anil ;tfje* language . i? 
supported/,by a wide range of meaningful paralinguistic 
and situational cues; context-reduced communication^ 
on the other hand, relies primarily (or at the extreme of 



^This theoretical framework should be viewed within a social context. The language proficiencies described develop as a result of various types of 
communicative interactions in home and school (see eg. Wells, 1981). The nature of these interactions is, in turn, determined by broader societal 



factors (see Cummins, 1981c). In order to emphasize the social nature of "language proficiency" 
"communicative proficiency" in describing the framework. 



this term. will be used interchangeably with . 



the continuum, exclusively) on linguistic? cues to 
meaning^and may in some cases involve suspending 
knbwle'dra ?f 4he "real world: in order to interpret (or 



manipulate) the "logic -of ■ >thfi r communication 



appropriately. 
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Range of Contextual Support and 
Degree of Cognitive Involvement in 
Communicative Activities 
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In general, context-embedded r communication derives 
from interpersonal involvement in a shared reality, 
which obviat6s. the need .for expiicif linguistic 
elaboration of the message. Context-reduced 
.cqmrnunication, on the other hand, derives from the 
fact that this shared reality cannot be assumed, and 
thus, linguistic messages must b£ a elaborated precisely 
and explicitly so that the risk of misinterpretation is 
minimized. It is, important to emphasize that this is a 
continuum and not a dichotomy: Thus, examples, of 
'communicative behaviors going.from left to/right along 
the cpntinuum might be: engagrng-4rKa discussion, 
writing a letter to a close frtfnd, writing; (or reading) an 
academic articte. Clearly, context-embedded 
communication is more tyoical of the everyday world 
outside the classroom, whereas many of trfe linguistic 
demands of the classroom reflect communication 
which isvcloser to the: context-reduced end of the 
continuum. *• * •'••*., 



2 



It is useful to distinguish between internal and extern 
.contextual factors in relation to the horizontal 
continuwfv Eternal context refers to aspects of 
language activities or tasks which are more or less 
objectively specifiable along the embedded-reduced 
continuum/Thus, inherent text characteristics make 
reading vand writing less context-embedded than' 
face-to-face communication. However, the location of 
any particular task.on the continuum will also be greatly 
influenced by internal cpntextuaHactors such as degree 
of familiarity andaccepfance opfie task or activity. In 
this way, the role of sociolirlguistic factors can be 
. afcco'rtimodated into the frarfiework (see discussion in 
yfaivera.ln press). 

The vertical continuum is intended ^o address the 
* developmental aspects of communicative proficiency in 
terms of /the degree of active cognitive involvement in 
the task or acwity. Cognitive involvement can be 
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exclusively linguistic in nature. • * 
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conceptualized jn terms of the amount of Information 
that must' be processed simultaneously or in ; close 
succession by tile individual in order to carry out the 
activity. 

How does this continuum incorporate a developmental 
perspective? If we return to \he fourccomponent's of. 
communicative competence (grammatical, socio-' 
linguistic. discourse, and strategic) discussed by Canale 
(1981), it is clear thgt within each one, some subskills 
dre mastered more rapidly than others. In other words. f 
some* subskills (e.g. pronunciation and syntax within L1 
grammatical competence) reach plateau levels at which 
there are no longer significant differences in mastery 
between individuals (at least in context-embedded 
situations). Other subskills continue to develop 
ttifcjflghout the school years and beyond, depending 
upon the individual's communicative needs, in 
particular, cultufal'and institutional milieux. 

Thus, th'p upper parts of tha vertical contirluum consist 
of communicative tasks and activities in which the 
linguistic tools have become largely automatized 
(mastered), and thus require - little active cognitive 
involvement Jor appropriate performance. At Jhe' lower 
end of the continuum are tasks apd activities in which 
the communicative tools'have not become automatized ; 
and thus 1 " require active cognitive involvement. 
Persuading another individual thar one's own point of 
view, rather than hers/his. is correct or writing an essay 
on a complex theme are. examples of such activities. In 
these ^ situations, it is necessary to stretchy one's 
linguistic resources (i.e./ grammatical, sbciolingufstic. 
discourse and strategic competencies) tp the limit in 
order to achieve one's communicative goals. Obviously, 
cognitive involvement, in the sense of amount of 
information processing, can be just as intense in 
context-embedded as in context-reduced activities. 

Aslhiastery is developed, specific linguistic tasks and 
skilrs travel from the bottom towards the top of the; 
vertical -continuum. In other words, there tends to be a 
high level of cognitive involvement in task or activity 
performance until mastery has been achieved or. 
alternatively, until plateau level at less than mastery 
leVels has been reached (e.g. L2 pronunciation in many 
adult immigrants, "fossilization^of certain grammatical 
features among French immersion students, etc.). Thus, 
learning the^phonology 'and syntax of L1, for example, 
requires considerable cognitive involvement for fwo- 
and three-year-old children, and therefore 'these tasks 
would be placed in quadrant B ^context-embedded, 
cognitively demanding). However, as mastery of tfoese 
skills develops., tasks involving them would move from 
quadrant B to quadrant A since performance becomes 
increasingly automatized and cognitively undemand- 
ing. In a se9ond language context, the same type of 
developmental progression occurs. 



"fye .third requirement for a theoretical framework, 
applicable to bilingual education is thai it permit the 
developmental interrelationships. ^etw6ea ? L1 v and 'L2 
proficiericy to be con'c^ptuafcized. There is considerable* 
evidence that L1 and L2T^||dency"are1nterdep6?idenJ. 
Le^manifestations of a^comm^fM^nderlying proficierjgy 
(see Cumrnins. 1981*). The evideneeteviewed in support* 
of the interdependence hypothesis primarily fr^volvect 
.academic or "context-reduced", languag^projicipncy 
because ^hla * hypothesis was developed Jexplicitly in- 
relatioib to ,the development of bilingual ac^demig skjlls. 
However, any language task which" is .cognitively 
demandingTor p group of individuals is likety to show a 
* mo£tei%te degree of in\e*rdjbpendence across languages. 
.-■ Also^other faotors '(e.g. personality, learning style, etc.) 
■■■ in addition . to gfeneraf' cognitive skills are likely to 
contribute to the relationship between L1 and L2 and 
thus some cognitively undemanding aspects of 
proficiency (e.g. fluency) may also, be related ^across 
languages. - . 

As far as context-reduced , language proficiency is' 
concerned, the transferability across languages of many 
of .the proficiencies involved, in reading (e.g. inferring 
and predicting meaning based on sampling from the 
text) and writing (e.g. planning large chunks of 
discourse) is obvioqs. ^However, even where the 
task-demands are language-specific (e.g. decoding. or 
spelling), a strong relationship may be obtained 
betweefi skills in L1 and L2 as a result of a more 
generalized proficiency (and motivation) to handle 
cognitively demanding ' co.ntext-reciucfed language 
tasks. Similarly, on the' context-embedded sidjg. many 
t - sociolinguistic rules of face-to-face communication. are 
language-specific, but L1 and L2 sociolinguistic skills 
may be related as a resist of a possible generalized 
sensitivity to sociolinguistic rules of discourse. 

In conclusion, the theoretical framework appears to 
permit the complexity of- L1-L2 relationships to be 
conceptualized at the same time asHt provides a more 
adequate rationale for the essentially simple point that 
academic skills in L1* and L2 are interdependent The 
framework also provides the basis for a task-analysis of 
measures of "language proficiency" whic^ would allow 
the relationships between language measures and 
academic performance to be predicted for any 
particular group of individuals. In general, the more 
context-reduced and - cognitively , demanding the 
language task, the more it will be related to 
achievement. However, although there are ^intrinsic 
characteristics of some language tasks which make 
them more /coghijively demanding and context- 
* reduced, these task characteristics must be considered 
: in conjunction with the. characteristics of the^particular 
. language users (e.g. L1 and/or L2 proficiency, learning 
style, etc.). For ^example, skills that have beconfe 
automatized for native speakers of a language may very 



well bo highly .cognitively demanding for learners of 
that.language-as an L2. Thus.'we would expect different 
relationships between achievement and certain 
language tasks in an LI as/compared to*an L2 context. , 
* » »■ 

s Assessment of Entry apdExi,t Criteria Revisited' ■ ^ ■ f ' 

The" theoretical framework can readily be applierflo the. 
issue of the assessment of entry and exit criterra. The 
problem highlighted . earlier was that if language 
minority students *man<fest 'proficiencies in some 
'context-embedded aspect^ of*, Eftglish (quadrant' A), 
they are often regarded as having sufficient "English 
proficiency" both to follow a regular English curriculum 
and to take psychological and educational' tests in 
English. What is not realized by many educators is that 
because of language minority students' ESL 
background,., the regular English curriculum and 
psychological assessment procedures are considerably 
more content-reduced and cognitively demanding than 
they are for English-background students/ In other 
words, students' English proficiency ^ay not be 
sufficiently • developed to cop§ witti communicative 
demands which are very different from those of 
face-to-face situations. 

What assessment procedures should be used for entry 
^gndexit in, bilingual programs? Givsn that the purpose 
of language proficiency assessment in bilingual 
education is placement of students in classes taught 
through the language which, it is assumed, will best 
promote the development of academic skills, it is 
necessary that the procedures assess proficiencies 
rejated to the communicative demands of schooling. 
However, in order to be valid, the procedures should 
also reflect children's previous experience with 
language. 8ecause the child's language experiences 
prior to school have been largely in context-embedded 
situations, the assessment procedures for entry 
purposes should involve cognitively demanding 
context-embedded measures which are fair to the 
variety of L1 (and L2) spoken by the child. However, for 
exit purposes, it is recommended that cognitively 
demanding context-reduced measures be used because, 
these more accurately reflect the* communicative 
demands of an all-English cl.assroorfi. If children are 
unable to handle the context-reduced demands of 'an 
English test, there is little reason to believe that they 
have developed sufficient "English proficiency" to 
compete on an equal tfasis with native English-speaking 
children in a regular English classroom. 

These suggestions derive from a theoretical analysis qf 
the relationships between language proficiency and 
academic performance'and clearly require empirical 
confirmation. However, without a theoretical framework 
for conceptualizing these relationships, legitimate 
empirical questions cannot even be asked; An example 



of ; a commonlyj pdsed empirical question which is 
> essentially meaningless when asked in a theoretical 
vacuilncv ' is •■' the- -issue of 1he relationship between "oral 
language proficiency" and reading. Witjiin , the context 
of the. present [framework, "oral language proficiency", 
bould equally *refe& to. ,cog r niJjyely/ undemanding 
context-embedded ^skills ae tg cognjhtive.ly de landing ■ 
- context-redbced %tlls.v As. one .wqu^v^pe'ct on the ; 
basis of the present .'analysis, there is little, relationship 
between these; twoo '"aspects of "oral language 
proficiency"; als6. reading skills are strongly related to 
the latter, but - 'unrelated tp ; the former (see e.g. 
Cummins. 1981). \ 

Two other applications of the present framework can be 
briefly noted. The first relates to language pedagogy. A 
major aim of schooling is to develop students'. ability to 
manipulate &nd interpret cognitively demanding 
context-reduced text, However, there is considerable 
agreement among theorists (e.g.. Smith. 1978) that the 
more initial reading and writing instruction can be 
embedded in a meaningful communicative context (i.e. 
related to the child's previous experience). the & more 
successful it is likely to be. The same principle holds for 
L2 instruction. The more context-embedded the initia.l 
L2 input, the more comprehensfble jt isjikely to be.-and 
paradoxically, the more successful in. ultimately 
developing L2 skills in content-reduced situations. A 
major reason why language miqority students have 
often failed to develop high levels of L2 academic skills 
is because their initial instruction , -has emphasized 
^ context-reduced communication insofar as instruction 
haS been through English and unrelated to their prior 
out-of-school experience. 

In summary, a major pedagogical principle for both L1 
-and L2 teaching is that language skills in 
context-reduced situations can be most successfully 
developed on the basis of initial instruction which 
maximizes the 'degree jof context-embedded ness. i.e. 
the range of cues to meaning. For language minority 
students, this principle implies a language experience, 
rather than phonics approach, to initial reading 
instruction (whether LI or L2). 

A third application is related to the influence of home 
preschool experiences on children's acquisition of 
literacy skills in school. Wells (1981), in a ten-year 
longitudinal study, has identified two broad types of 
communicative activities in the home which strongly 
predict the acquisition of reading skills in school. One is 
the extent to which there is "negotiation of meaning" 
(i.e. quality of communication) between adults and 
children; the other is the extent to which 
literacy-related activities are promoted in the home 
(e.g., reading to children). There is no clear-cut 
relationship between SES and the former, but a strong 
relationship between SES and the latter. 



5 lt should be pointed out that the framework in no way implies that language pedgagogy should be context-reduced. There is considerable 
evidence from both first and second language pedgagogy (e.g. Smithr1978; Swain 1978) to support the principle that context-reduced language 
proficiency can be most successfully developed on the basis of initial instruction which maximizes the degree of context-embedment. n other 
words the i more instruction is in tune with the experience and skills the child brings to school (i.e. the more meaningful it is) the more learning 
will occur. This is one of the reasons that bilingual education is. in general, more successful than English-only programs with language minority 
students. . - 



These results have two clear implications in terms of 
the pre^tent framework, First, the strong relationship' 
observed between- both literacy* activities and 
negotiation of meaning in . lh&, home and the later 
acquisition bf 'reading' ihifchool supports the principle 



<t-reduced *6ommunicative 
^essfufly developed on the 
ed communication, or to 
•opportunity the child has 
linguistic input (Krashen. 



proposed abQve.-tn^ { 
proficiency ca^beNff^: 
basis of prior confew 
put it another Wfjy,.*ne 
to process comprehensible 
1981) and to negotiate meaning, the greater the range 
of input which will becomfe comprehensible. 

The second implication of Wells' findings is that many 
l&w SES students experience greater initial difficulties • 
in school literacy acquisition because the school makes 
little or no pedagogic adjustments for the fact that 
these students may. have had relatively few 
literacy-related experiences prior to school Children 
who have had extensive prior context-embedded 
literacy-related experiences would t seem to 'be in a 
better position to handle a more phonically oriented (i.e. 
less meaningful) approach than children who lack such 
experiences. (This, however, is not to imply that such an 
approach is desirable for these children.) It would thus , 
seenrvespecially important to make low SES and/or ESL 
students' initial exposure to literacy in school as 
meaningful and context-e.mbeclded as possible, perhaps 
through a language experience 'or similar approach. 
Although most middle-class children may be able to 
handle more context-reduced approaches to teaching 
reading, it is possible as Smith (1978) suggests, that 
this may not be the case for childre^ who develop 
learning disabilities. In othec^words. it is/possible that at 
least some of these "disabilities" are pedagogically 
induced. ■ 

-The fourth application derives from a large number of 
studies showing strong relationships between L1 and 
L2 academic skills (e.g.. Cummins, 1981). In terms of the 
framework, we would say t(iat context-reduced 



cognitively demanding aspects of language proficiency 
'are interdependent across languages. Thus; ar}^ 
immigrant child with well-developed skills in his/her L1 
will likely develop academic skills in L2 to a similar *■ 
* 'degree. Assessment of a child's L1 proficiency clearly 
has/important implications for placement* 

Mn summary, the major reason for the^ oonfusion in 
regard to assessment procedures for entry and exit 
criteria in bilingual education is that neither the 
construct of .language proficiency itself, nor- its 

* relationship 'to the development of cognitive and 
~ academic 'skills has been adequately conceptualized. 
VThe extreme positions (1) that language proficiency is 

- essentially independent of cognitive; and ; academic 
skills, implied "by some socioling'uists, and ; (2) that 

• language profi$pncy.is largely indistinguishable from 
■? cognitive and academic skills.* suggested by much of 

'the psychometric research Reviewed by Oiler and hi^ 
I colleagues, arbitrarily identify particular aspects of^the 
construct of language* proficiency with the totality of 
the construct, lathe present paper, it has been argued 
that language proficiency cannot be conceptualized as 
one static entity or as 64 static entities. It is constantly 
devel oping ; , along different dimensions » (e.g. 
grammatical, sociolinguistic. discourse and strategic? 
dinrlensions) and being specialized for different contexts 
of use among monolingual English-speaking, as well as ' 
language minority^ Children. In "academic contexts, 
certain aspects of language proficiency develop in 
specialized ways to D^cpme the major tool for meeting 
the cognitive and communicative demands of 
^ schooling. 

A major implication of the present frameworkis that 
recognition of the very different communicative 
proficiencies required of children in school encounters 
as compared to the one-to-one. face-to-face interaction 
typical of out-of-school contexts is a first step towards 
the development of theoretically and empirically viable 
entry and«exit procedures. * ■ 
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ARE WE TESTING FOR INTELLIGENCE OR LANGUAGE? 



John W, Oiler. Jr, 
University of New Mexico 



In the first part of thts^eCade, at least three books will 
appear which^eal /principally with the topic of 
language testing issues. 1 During "198 V, the British 
Council sponsored Issues in Language Testing "edited 
t?y Charles Ajder§on and Arthur Hughes, and the U S. 
Department or Education and the Illinois State Board of 
Education sponsored Issues 6f Language Assessment: 
Foundations $nd Research edifed by Stanley S. Seidner. 
In 19)32 yet another volume, Issues in Language' 
Testing Research. Is slated to . appear under the 
Newbury House imprint. Among the unresolved 
-questions in all three volumes? is .the one posed here: 
Are we testing for intelligence or language? 

. y V. * 

What is language? What is intelligence? . 

On the definition of language proficiency, there seems 
to have beqn some progress toward acceptable , 
theoretical models — see Cummins (this volume, and 
also in press); Bachman and Palmer (in press); Canale 
(in press) and their references. There is some agreement 
now that both holistic and analytic models of language 
proficiency are useful. 

However, on the differentiation of language proficiency 
from intelligence, there-seems to be less agreement (see 
Carroll/ in press). There are at least two aspects to the 
problem. On the one hand, there is the theoretical issue 
of saying what either of the constructs £t stake actually 
is, and on the other hand, there is the question of what 
is measured by tests intended to quantify -one or the 
other construct. The theoretical and the measurement 
aspects of the problem are related, but are not identical. 
It is conceivable that intelligence could be distinct from 
language proficiency deep within the human psyche, 
and yet. that tests aimed at intelligence might still 
largely measure acquired language proficiency. 
Another possibility, which seems intriguing, is that 
intelligence itself may be dependent on a kind of 
•grammatical system- • — a logic of experience. This 
system might be distinct from any particular language, 
and yet. be dependent on the development of 
proficiency in some particular language in order to 
mature normally. Other possibilities have also been 



'suggested and superior options no doubt remain to, be 
worked out. As ( Carroll (in press) suggests, there is a 
good deal of relevant research, but it does not yet justify 
a definitive demarcation pf intelligence and language 
prdficiendy. . '* " « 

Therefore, setting aside the measurement aspect 
'momentarily, it ' may be useful ' to consider / thfe 
theoretical question. Tp begin With, threewspequlative 
premises are put in the form of "what-if postulates. 
With these premises in mind, some evidence is 
considered on the nature of intelligence and its relation 
to language processes. 

Three What-lf 's. . . 

(i) What if cognitior\ itself were dependent on a kind of 
language system? Call this system Basic and say it is a 
language for representing and interpreting the events.of 
experience. 2 Suppose * we assume for fye sake of 
argument that Basic is preprogramed an<fc§jptures on a 
biological schedule along the lines of Chomskian 
"innate ideas." Basic might bp construed as a grammar 
specifying the structure of texts of experience. 

To illustrate h9W such a grammatical system might 
work, consider? the following example. Imagine that you 
are in a certain American embassy in the Middle East. 
You work there. You are aware of the crtyt&sjhat: are 
mijling arourid outside, The noise andl tension is 
building. Suddenly, a window, high above\ your desk 
shatters into a spray of fragments. A brnsjk hurtles 
toward>ypur head. You duck; The brick clatters to the 
floor... 

/ • . 

It may be argued that various textual strud<wr€s are 
taken into account in order for you to react intelligently 
by ducking your head. The immediate proposition 
which prejents itself is the brick crasWig thrpti^h the 
window and proceeding on a trajectorV-Uw will pass 
through the space now occupied by your head. You are 
suddenly aware that the brick (a kind of grammatical 
subject in propositional logic) 'has. just broken the # 
window (a direct object) and is proceeding (a transitive 




Meetings of Colorado TESOL in September. 1980: the New York ESOL and Bilingual Education Associations ,n October 980 the Ontario 
Sate for Studies in Education in February. 1981: New Mexico State University in Las Cruces during the mohth of April. 1981. and the 
American Universfw in Washfngton. D. C. as "he fourth in the Hugo J. Mueller lecture "Series, in April. 198T. Thanks are expressed espeaany to 
Mark C?a?ke and John Haskell for theif comments and encouragement which spurred ( the complet.on oflhe Language Learning (December 1 981) 
wreion under the title ^^-Language as Int^mgence?- Other readers or listeners whose comments have helped t^hape the final product have 
included Jack S. Damico. G.Richard Tucker. Mary.Ann Hood. Frank Smith, and Helga Dehsle. . V- 



2 Of course th IS isnof&Wthat the language of experience is basic in the sense of being the lowest order system. Clearly it depends on the 
e^istenc 'of man ^?ower o7der systems which feed information into it. Basic presupposes the existence of a body and all of its senso^otor 
systems It also an'ticpates a great deal about the nature of even't structures external to the organism. It both specifies anc Makes ' , "f/ 0 c n c s °7 e , d ,h a e s 
knnwIPrinP hasp that is orooressivelv materializing over the temporal course of experience. In this sense, ordinary experience may be construed as 
a X (meal -felt! or better 9 a series ofTexts. which cbme to the orgamsm over the course qf time. The interpretation of this series of texts ,s a sort 
of reading process which, however is not perfectly linear m nature - • 



predicate associated with the Subject) toward your own 
head (another -potential cjirect object). You anticipate 
the consoquonco that tho. brick is about to hit you in tho 
head (an undesirable proposition) which you nogate by* 
r rempving your head from tho trajectory of tho brick, 

The point of the example is to suggest that perhaps tho 
meaningfulness of tho actions and interpretations of - 
mundane experience may be contingent <upor] a 
grammatical system> which utilizes textijally complex \ 
relptiQns. Amon£, them wtfuJd Be, subject-predicate 
relations, negation, and Conjunctiqo/as* welPqg textual ; .■ 
• operations of inference* concerning antecedent 
propositions (the fact that someone threw the brick) 
and consequent propositions (th$ brick is about to hit 
you in the head, but if you duck, it will miss). 

All of this no doubt could be made much , more 
elaborate and explicit, but the point is sijriply to try to 
show the surprising plausibility of- the .idea that 
cognition may be based on a deep language system — 
not any particular' lahguage (e.g.. Chinese. Russian. 
English, fvravajo. or what have you) •y'but on a universal 
logic of experience which' matures on a biological 
timetable given- sufficient rfutrition and normal 
exposure. • \ 

(ii) What if grammars of particular languages (English. 
Spanish. Chinese, etc.Lwere mapped into the texts of 
Basic (that isljj the language of experience) v-ia a kind of 
translation process. - Basic would underlie the 
intermediate particular language system and Would link 
that language t0the event structures (i.e., what we have 
termed 'the "tegts") of experience, in its turn, the- 
intermediate particular language system would underlie 

he manifest forms appearing ultimately as utterances 
Metfts in that natural language system (English, or^ 
whatever). 3 ... . " 

(iii) What if "non-verbal intelligence" were actually 
some aspect of the capacity to negotiate texts in Basic, . 
while "verbal intelligence" were more closely linked to 
the intermediate grammar of a particular language and 
fts maryfest forms (speech, writing, or sign)? Over the 
course of development, Basic could be expected to 
develop along with the acquisition of proficiency in the 
particular language system. 

If these three uncommon assumptions were allowed, 
just for the sake of' argument, an act of intelligence 
could be defined as a representational problem of fitting 
texts either to the event-structures of experience or to", 
other texts. For instance, Basic texts are fitte'd to the 
facts of experience, and texts of a particular language 

I . -'- ' ' , •' - 



ore similarly fitted (or in Piaget's term, "equilibrated") to 
Basic, In tho case of verbal acts, we might say that tho 
facts or ovonts manifest in experience are represented 
in Basic and then may bo translated via a particular 
tanguago into manifest forms such as utterances or 
writfen surrogates; 

It follows that a crirerion of "goodness of fit" is'logically 
bound up^ lh * this , definition of intelligehce. * The 
negotiation of event-structures* Is "intelligent" jdst to 
the Taxtent' Uiat the .implicit and expiioi't^pljOpositiwial 
Values fif arid ai*e appropriate "to thoracis of experience. 
That Is, actifin can be 'judged .to >be intefHgent Tonly, it 
would seem, in relation,to some notion of truth, validity, 
correctness; functionality, or value. V 



How Do You E?t an Elephant? * 

The question that we started out wifh now becomes 
more problematic than before, To illustrate how, it may, 
be* useful' to consider the riddle pfcsed in the, 
immediately preceding subhead:' How do you eat an. 
elephant? The answer is so'obvious that it may easily 
be overlooked: You eat an elephant one bite at a time. 

The riddle is instructive in at le&st two ways, For one 
thing, it reveals . something significant at^out ,4he 
temporal nature 
Experience comes to us in sma 
momenta day by day. Language too is, highly 
sequentialized and broken up into units consisting of 
sequences of sgbunits and' so forth. So, we eat our 
respectiveelephants just one bite at a Jime^ 



of experience 7 and language, 
to us in small bites, moment by 



However, the riddletaan also be instructive in another 
way. Jt illustrates something about the process of fitting 
text to the facts of experience. In it we gain some idea of 
what is entailed by the notion "goodness of fit." 
> * r 

For instance, wiy did the riddler chdose an" elephant? 
' Why not a mountain ? 

The riddle would have lost something. For one thing, 
mountains are not normally edible. Besides, the riddler 
. knew that humans^are generally carnivorous, that they 
kill for food! He understood that his audience knew 
English and knew about elephants being edible. He also 
"was aware of the fact that most English speakers o dQn't 
ugually eat elephants. Thus, they might well be decoyed 
away from the correct answer by the problem of how to 
prepare elephant meat — something they would not 
likely have ever done. 



Here and throughout this Paper the term "text" is used in the sense of an "actual" system. rattler than a "virtual system (see fe Beaugrande. 
1980) A virtual system merely has potential for realization in experience, while a text actually occurs in the stream of real experience. De 
Beauarande and others (e.g., see John Dewey. 1916) argue that actual systems are interpretable and meaningful in ways that'v.rtual systems 
cannot be Note that this is radically at odds with the earlier claims of some linguists who argued that a sentence, for instance could not have any 
meaninq in a context that it did not have in isolation. To the contrary, the argument for viewing texts as actual rather than virtual systems 
suggests that temporally progressive experience invests elements of text with meaning that they could not have apart from that particular and 
unique rea/context. 



4 The term "nonverbal intelligence" as it is used here is intended to refer to wha^exists, without assuming that it can be 



Hmmmmm. So, olophants ore not commonly used for 
food by tho nudiorjco. Noithor oro grasshoppers, so why 
not a grasshuppor? 

It just wouldn't¥nok« as good a riddlo. For one thing, 
grasshoppers ai\.not larqe enough to bo a vcW 
challenging qu$Vy, neither ar6 they a groat challenge to 
eat— at least not because of their size, \ 

WelL'then, why not ask: How do you eat a^truckload of 
grasshoppers? . ^ One grasshopper at a time. Gulpl 

• . ' • , • •* '■; • -i. 

It's shocking in a way, but it .misses the riddler's point » 

* which was something to the effqet that' a gigantic' 
problem can be dealt With in srfiall phases. ' 

x ' * • ;. 

. Isjj't there also some shock value in the idea of eating 
an animal that is ofter> used as a beast of burden?, The ' , 
riddler could have achieved a similar effect bychoosirfg 
d horse, but horses seem far less formidable, and to 
most English speaker^, eating one seems a little too 
close to eating a dog or cat, The idea borders on 
cannibalism. 

*Do*esn't the choice or an elephant also^ring with 'it a 
whole mystique that would otherwise be lost? Doesn't 
it call to mind scenes of African .lions, dark jungles, 
Bengal tigers, and Indian gurus? 

• 'And what about -the existing lore concerning 
elephants? Consider the expression, "She has* ^memory ■ • 
like an elephant." Of whfat about the well-worn parable 
of the blind. rfien and the elephant? And don'Kforget the . 
elephant riddles of the last two decades (e.g., "Kow do 
you hide an elephant in a cherry tree?" Or "How do you 
get four elephants in a Volkswagen ?"). 

So much to .provide just a. glimmering of the 
significance of the notion "goodness of fit." 

Is it possible that intelligence is really a characteristic of 
symbol systems capable of generating texts which fit 
the "facts of experience? Jean'' Piaget observed some 
decades ago that "every definition of intelligence comes 
sooner "br later to lean on biology or logic" (1947, p. 3). 
The approach here leans heavily on biology and also on • , 
a logic of natural experience — the sort that Dewey • 
(1916) had in mind in his Essays on Experimental Logics 
It is also possible to make a psychometric case for the 
definition of intelligence considered here, but this is 
attempted in another work (Oiler and Streiff, in press}. 
With all of the foregoing, in *nind, some faf the genetic, • 
cytological, and neurological evidence bearing on the 
proposed definition will first be considered. Then, 
having sketched in some of the evidence supporting the 
proposed theoretical framework, the measurement 
problem will be dealt with briefly through an analysis of 
a' few sample items drawn from "non-verbal" and 
"verbal" IQ test items. 



Blol6gical Evidence 

Aqcordi)ig to a biological approach, porhaps the 
primary characteristic of intolligonco is adaptability 
Piaget (who incidentally began his carolar as a biologist) 
• describes the jCapacity of an organism to "assimftote" its 
environment to itself and to ^accommodate" itself 40 tho 
environment wherever "chango is neodec) and ' tho 
environment itself is unyielding. He rofers to the whoje 
process as "equilibration." . ' ' 

Nowadays, biologists are turning increasingly to the 
texts which make this sort of intelligent /action or 
equilibration possible. In 1.977^'ithe eminent biologist/ 
Brian Clark, commented that;„the .deciphering of. the 
"genetic code" in the mid 1966&>wa&"one of the most 
significant advances in 'this century in the biological 
' sciences" (p. iii). It had been kn<?wn for several decades 
that protejns played a crucial role in the development of 
living things', but it was not until the 1960s that the 
structure of proteins was systematically linked to the 
code written in the DNA molecule^of^ell nuclei. Since 
then, biologists 'have . come' to refer to the 
macro-molecules of DNA and also to the proteins as 
texts. 

Is it' poss'ible, therefore, that thetO might be a language 
basis' for the existence and structure of organisms? 
Could the order we see in living organisms be 
dependent on a language system? 

Martynas Ycasfc another ' eminent biologist, wrote in 
1 969: 

• * * * 

It is now both useful and commonplace to 
regard the proteins as a topological^ linear 
'text' written in a 20-symbol alphabet, the 
/ amino acids. This text, in turn, is specified by 
/' (or encoded in) the nucleic acids (DNA and 
RNA), another text whose alphabet has tour 
symbols, the bases. Some t)f these nucleic 
acids are the self-replicating genetic material, 
and thus the information specifying proteins is, 
transferred from generation to generation. The 
coding problem is to determine how one text 
specifies the other (p. 1). 

xShortly*after the cracking of the code in 1966, another 
not^d biologist, Carl Woese, wrote: 

The cell has a certain vocabulary of words . . . 
with which to construct messages in nucleic 
acid language? These* messages are then 
translated into amino acid language by means 
'of a dictionary or code book . . . The cell seems * j 
to employ a tape-reading process in the s 
synthesi^of protein (his italics; 1 967, p. 4). 



Additional biological evidence lor soma sort ol 
language system comes from observations ol 
communications that occur within colls and between 
thorn and tho oxtomol \rVofld Mow wo such 
cAmmunications possible? How, for instance, does a 
coll know its functions? ]iow is its oquilibrafhd 
knowledge codod, stored, rotroived, and utilized? 

For example, consider the flight of an omootjo from light 
. or heat. Whan tho source of tho disturbance is 
introduced into tho modium. tho nmoobu doos not tOnd 
. ? to mdVo toward it or oven parallel to it,,but awgy from.it. 
In sQmo monitor, it soorhs, tho amoeba takos account 'of • 
a.sto,tp,of affairs oxtornalMo itself, adjusts its internol 
states.\and thus alters its relation to tWfextornal state 
'so. opto restore a condition ,moro favorable to its 
corftinuod existence. To describe whm happons inside 

* the organism ir* purely mechanical (physico-chemical) 
terms does not really do away with^he most difficult 
aspect of thtT problem. Even if a mechanicaj basis is 
found for the movements of the. organism, there is still 
the problem of how the internal events, are set in f 
systematic correspondence with external states of 
affairs so as to enhance the organism's likelihood of 
sufvival. 

Of all people, the author of the precursdf of the modern 

* intelligence tests, Alfred Binet, made an interesting 
foray into this area as early as 1888. While looking for 
certain of his writings on intelligence, testing. I 
stumbled onto a -book entitled. The Psychic Life of 
Micro-Organisms, published in 1888. 

In. that remarkable little book. Binet claimed, aniong 
other things, that single-celled organisms lead a 
^surprisingly complex mental life. For Instance, he 
argued' that they possess mental capabilities that 
enaBlfe them to display emotion* such as fe,ar. sexual 
arousal, memory, and instinct.. 

To illustrate the capacity for fear", he wrote: 

There is not a single ciliate infusory that cannot 
be frightened '. . . If a drop of acetic acid be 
introduced beneath the glass- slide, in a 
preparation containing quantities of infusoria, 
the latter will at once be seen to flee from all . 
directions like a flock oi frightened^sheep (p.-vi). 

. Perhaps his 'most interesting argument is found in his 
observations about the sexual behavior of micro- 
organisms. He described the proqess of courtship, 
pursuit, and union in paramecia as having all flavor of 
similar rituals in higher organisms, including the game 
of "hard-to-get." The s chase, as he described it! ends in a 
• "blending". of the two organisms "at an-important part of 
their persons in a very intimate fashion*' (p. 69). 
Although Binet could not have known it in 1888, the 
organisms were actually exchanging their genetic texts. 
He did see this process, however, as a microcosm of 
what higher organisms perform in their own sphere. 



Ho quntoil Halbiani (a famud biologist el that era) who 
said: 

4 I boliovo that tho upormatozoids do not movu 
about blindly, but . . act jfvoboc|ionct) to a kind 
o| inner impulsion, to a sort of volition which x 
/directs thorn tb t\ definite object (p. 78). 

r V ' 

If'Jhis is, tho caso. Vo wo not moved to ask on what 
forois tho sperm collA know wljat thoy art) after? How iti. t 
tho # plan coded' insic(e tho organism? How doos tho 

. organisnVtaM) qfcoufy of thinHp^xtornal to jtsulf so 
that it kho^vs whon to Atart oxecutFri^ts plan and when 

Mho plarw'has boon culrhinoted? Could tho plan of tho • 
sperm ciftls bo simpler than tho sort of logic underlying 
propositions that rolato ogortjs to objects via certain * 
ihtermodiato action's? (On\human goals and plans, soo 

•Schonkand Abolson, 1977)\ v & ' 

■ . .. • . \ • 

Binet wae not unawaro of^thq considoroblo- problom 
which his olfSqryatio.ns inevitably brought to light. 
What- sort of internal mechanism would explain the 
complex communications that he was led^o infer? He 



put the difficulty like this: 



\ 



What would be necessary to explain. \s how 
'and in consequence of what rr^echanism of 

structure, one form of molecular movement. 

corresponding to a giv^n excitation, is followed 

by a certain other foi m of molecular movomont * 
. corresponding to an act likewise determined (p. 

65). 



Binet correctly anticipated the prevailing inclination of 
present day theorists toward a mechanistic explanation. 
However, he seemed to be arguing' that such an 
explanation would hot succeed in eliminating the need 
for a rationalistic account. Why else did he title his'book 
The Psychic Life of Micro-Organisms ? (See -note 5.) 
Binet observed that the nucleus is "an essential factor in 
the cell's vitality" (p. 100). a seat of government from 
which the protoplasm receives its "delegation of 
physiological powers" (p. 101). His remarks raise images ^\ 
of politics as much as of biology, and ftey.suggest that 
even the Simplest one-celled organisms are like minia- 
ture state's requiring communication, transportation, 
and defense. ^ 

Certainly, som& significant governmental problems 
exist between the cells of multicellular organisms, How . 
do the various types of cells communicate with eagh 
other? For instance, when you cut your finger, how are 
the orders issued to reconstruct the damaged tissue? 
How do the cells cooperate in transporting the 
necessary materials to thV.site to' do the reconstruc- 
tion? And, in general, how are the differences in 
function established and maintained across the 
numerous cells of any complex organism? 

In a popular article. "Pattern Formation in Biological 
pevelopment." ^Lewis Wolpert noted that "there is a 



5 AcXuaWy. mental would probably be a better moderntranslation of the French term psychtque. thus changirjg .the title to The Mental Life of s 
Micro-Organisms \. ' \ 
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WUltlhprthuJ [ui\Ui\ that i tiil-t huvt* tulllpItU ( ulivnl^ 
lions wi|h »m» K olhoi-duung dovtilupnutnt ' (Hi/M, p 
Howuvoi, \»» lunloimd tu puml mil lhal l*« his 
own poii, Mm |mlit>v*id tlu* i nnvotsotuiits to ho mlalivoly 
^ bhol, ond possibly uvon "buring* NintiittitfU»u»i. ho 
acbwltod Ihiil t\ liiryt* juiinlitu c #t o*p«*nmnnls indicate ' 
that 'I until patloriei arise as tho losult uf c^ill t* ^ coll 
intonation" (p KM) 

fhe question, iih Wulpuit noted, is whal kind el 
phyHlun c:httmit:iil iiuu:hiiUK«u?i i« employed m mtlM 
ccilluliii communiealiunr' J ho fad tluit such 
eomrnunicrthon hikes place .is not nuostiunod One of * 
tho things that imeit l)o\:u!ni?Hinicnloc| hem co4l lu eoll 
durinu omhryological ilovolopmopt is inlonnalion ithuul 
location tn th« developing organism Apparently 
biologists bolio.ve that this sort of. information is crucial 
tu tho dilforotituftion of colls and the lorm.itiuM ol tlu? 
various organs of the body, * 



Tho mechanism lor intercellular communitMhons, 
hoftovor. remain:, a mystery Is it conceivable that any 
basis could exist which will not ontnil textual 
complexities' of this soil which are inferred for 
intracellular communication ? Docs tho genetic basts of 
hfo suggest anything other than some sort of languajr^ 
for'interaZl communication?. And what abtwl tho iwse 
of biolofjAl organization in higher organisms? 



Moro specifically, wlmt about roan? 



\ 



At least since the germinal work of Karl Lashloy (1 Obi ). 
it has been known that human cerebral functions 
exhibit certain text-like properties Liishley pointed out 
that a movement as simple as "reaching and grasping" 
is dependent on the sort of hierarchical organization 
characteristic of the intricate structuring of sounds, 
syllables, words, phrases, and clauses of discourse. In 
his Programs of the Brain (1^78). the Oxford biologist. 
J. Z. Young writes: 1 , 

To unde«>tand the language of the brain it is necessary 
to know how the nerve cells combine, like letters or the 
phones of speech, to produce units that ha\/e meaning. 
Iike'words . If grammar is the system that regulates 
the . . . use of language, we might say that 'the brain 
operates a sort of metalanguage with a metagrammar, 
which regulates . . . the conduct qJ life, including speech 
(p 46). 

How is it that the temporal coordination of hierarchical 
sequences is accomplished? Even the simplest sorts of 
actions — e.g., tying shoe laces, buttoning *3 shirt, 
opening a door, starting a car — all involve delicately 
orchestrated sequences of events. Returning to, the 
"what-ifs" stated at the outset, it might be proposed 
that the ordering of events is accomplished via a kind of 
universal language system. . At times there may be 
interaction between this universal Basic system and the 
grammar of a particular language, but there are good 
reasons to suppose that this is not always necessary. 
The question then is, how does the.brain relate holistic 
impressions with highly grammatically Structured 
sequences of events? Is it possible that the 



dittoiHutirtlioii (it functions in tho two henubpheres ut j 
the hiam may shmj burnt) tight mi It pigve^ 

With lespout to homophone bptu oili/aliun, Hogen. a f j 
ntniiunufgoon anil Professor of Mediant* ill Ihttj 
University ol Southern California, vyntoti / 

tho right 1 ^ionosphere is " specialized lor 
procuring H mo -i rule pen dent Mmuilus . / 
configurations «nd tho loll hemisphere lor 
Hnne.-oidyrod stimulus awgutuicosi (H)7/, P 
Ul) 

C)t coortio, iln-i does not moon thnt thosti funelions «r« 
completely separated Howovw, tho lull hemisphere as j 
t)«Httir tut organizing and inttirprojhng htururchically^ 
arranged sequences of words, nurnbura, or c>thor 
catteries, wtiilo tho right is bettor at luindliny Gostalt^.^ 
images, or complex totalities It soOins that tho loft/, 
hemisphere excels at taking things apart (aruilyiing)/ 
while tho right excels at handling global pattern* 
(synthesizing) 

It has become popular to relate the different 
specializations of tho two hemispheres to the most 
widely attested ■ typos of intelligence "Vorbal 
intelligence" is assorted with the left hemisphere, 
while "non-verbal intelligence" is attributed largely to , 
the righ!. hemisphere Verbal processing is highly 
lemporalizocINOn the other hand, taking account of a 
~^ Gostolt. or a global pattern, seems to require a Certain 
simultaneity that must largoly disregard tho temporal 
aspect of element^ within the whole 

The picture is complicated by the fact that both 
hemispheres seem Capable of performing both types of 
function to some extent. Also, it seems that the linking 
of Gostalts to temporalized text-like structures is an. n 
integral aspoct of ordinary experience - * ) 

Although the notion that^brain functions must occur m 
particular sites of tissue became popular immediately 
after Broca's discovery of the so-called "speech center- 
in 1861. more recent research reveal^ more or less 
global brajn activity during performance of demanding 
tasks. 

Based on observations of blood flow^ to the various 
v regions of the cortex in both hemispheres. Lassen, 
Ingvar, and Skinhoj (1978) conclude: 1 

It appears that for the brain to 'understand' the 1 
surrounding world, to perceive its meaning and 
\q take action in difficult tasks, the cerebral 
cortex must be activated not only locally, but 
also totally (p. 71). 

As Pribram (1971) has' observed, there are many brain 
functions that seenrvto be distributed globally. He cites 
the case of our ability fo write with the non-dominant 
hand. The question, is, how can the information about 
writing which is presumably stored in the left * 
hemisphere (which controls the right hand) be 
transmitted to the right hemisphere and from there to 
the left hand. Or, as Pribram asks, how is it that a 
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right bander willVe able to write l/Tnatn* with his left 

foot Ml lloUl btrokds* in Uld 5>dhd dt t ho add shufe* 1 How 

can specialized td^m bd -*oijifimiiii atad jh tolo t^m 
one br dMi region to another / 

In an attempt to solve Una intriguing |iu//lo and bthdis 
dSSOCldtdd WltK ll. Pribram piopubdb that thd dluaiytJ 
"enyfam" (the brain's oftit of mammy) nidy d* tUdlly bd d 
WrtVd ffiif 111 thdt hdS holographic plopdltldS II Putnam'* 
idea Its correct, thd brain rdprdsdnts the world in wjih d 
way thdt its coded inhumation has d holistic 
(analogical) correspondence V) 'e*pdr idnce. ralhdf man 
d feature by tddturd (digital) cutiaspurtdd'ncd Among 
J other things, Pribram s "holograptm: memory" wuuld 
help to explain w,hy it is thdt an much nn fif thd 
tissue associated with d particular brain function may 
be destroyed, and yet. the function may still remain 
intact 

But what then must becomes of this dichotomy ihta 
divides intelligence ihto the supposed verbal, funotions 
of the left horninphdi|tTd7in thd nonverbal of thd riqhtf 
Tho question fleemslo ba whether the brain divider up 
the work in the supposed manner between left an.d right 
hemispheres, or whether thi t s is not somewhat of an 
oversimplification 

It seems that many •ol*the most, common uses of 
language require both 'temporal (sequential analytic) 
and simultaneous (Gestalt synthetic) operations Aren't 
these logically aspects of the same, textual processes? 
Imagine, for instance, what sorts of brain functions 
might be involved m describing how to row a boat Or. 
consider what mepfal processes are involved in 
describing the stars in the sky on a particularly dark and 
deaf night Don't both of these uses of language roquiry 
analytical and synthetic mental functions? They both 
seem to require handling of hierarchically 
sequentiali/ed elements of language as well as the 
processing of holistic and fluid impressions of 
experiences In fact, isn't it the case that the verbal 
descriptions mtfst be pragmatically mapped onto the 
holistic impressions? 

To account for sucb.simple brain processes as these, it 
would seem that a theory is required that •sees both 
"verbal" and "nonverbal" aspects of intelligence as parts 
of a more inclusive totality It was\fpparently this sort of 
perspective which led *UCLA brain scientist. Harry 
Jenson, to argue that "a nonverbal test of intelligence 
may be a contradiction in terms" (1977) In the same 
context he said, "language and language-related 
performances are inevitably dominant in most human 
performances" (p 59). 

Is it not possible that, rn fact, experience itself may be 
essentially textual in nature? That is. in order for it to be 
meaningfully processed, could it perhaps be necessary 
to perceive the complex linkages of events in their 
apparent terfiporal progression! through some sort of 
propositionally complex language system? Is it possible 
that the translation of nucleic acid language into 
protein language might actually provide a useful 
analogy for trie way experience is interpreted through 



tho uad uf ttdlufdl language ayManta/ Loijhl Halting 
dhoul d.ipdridHi.ri lid d process likd f/v*/Js/4Pt »/lffoti i uhti 
language into dhuthtif ' In fact IS II pnssitilfti thdt talking 
din iwt dMpfetjom d la intj»d like translating Pet ween 
language systems than Ukp relating a lanuuaud i>y?Mpm 
to surnethmg thdt is dssdnhally n.mproposttiundl in 
ltdtuid? 

So mui ft for biological dyidencd on thd theoretical 
framework to Idldte dll /f file foregoing to the problem 
uf intelligence testing We HOW turn to dll eadmu^dtidn 
til aume typical 'inldlliyepce" items 

! * ^ 

Whot DoJQ Item* require' 



nujd 



A logical analyse* ol the structures required by tasks m 
bo called ^intelligence" tests would sfttiin to havd 
substantial bearing on thd determination of what, 
intelligence tdsts measure It alsq bears heavily on what 
may bd called "the language hypothesis' thd idea 
thai nonverbal and verbal intelligence tdata probably 
nieariurd acquired language sJnlls more thar> ahythir>g 
else (eg, Oiler ami f.'drkms. 197UJ Ou«» empirical 
approach to the testing of this hypiT thesis would be to 
go to the test takers themselves and get them to. 
introspect about how' they are able to solve the 
problems in the tests Roth (1978) used this* approach 
Another method is to examine tho variance in tests by 
factoruil methods ^Carroll, in press. Bach man and 
Palmer, in pruss) 



Still another method is to examine the ilerns\^>44-eTe 
included in certain so called "intelligence" tests 
(Gunnarsson, 1978) The Durpose of tins sort of analysis 
would be to determine what mental operations must 
accompany the solution of commonly used lypes of 
items The question is - What sorts of internal 
representations are essential to the solution of typical 
intelligence test items, and what kinds of operations 
must be performed on those internal representations? 
Possibly such an aru^Mhs will help to lead us bwyond 
'the claim of Arthur "nseti (1909) tbat intelligence is 
s "what intelligence tests measure" (p 8) v 

First we will. examine some of the most commonly used 
"intelligence" test items of the nonverbal typo, and then 
we will have a lookjat some "intelligence" items of the 
ve^al typo 

Nonverbal IQ Itemk Anuyfg exemplary nonverbal IQ 
tests are Cation's CoHtrre Fair Intelligence Tests and 
Raven's Progressive Matrices Jensen (1980) refers to 
these as nearly "pure" measures of "general, intelli- 
gence" . * 4 

Figure 1 contains examples of the four item types that 
appear in the Caiietl tests The third row in the figure, 
labeled "Matrices." represents the sort of items 
appearing the Raven's tests as well. Thus, an exami- 
nation of these four item types should provide a fairly 
representative idea what typical nonverbal items 
require These samples come from a pamphlet prepared 
by the publishers of the Cat tell "tests to inform users 
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dtiOUt the* toil ami 1U d(iplK rtlltiftb S*l»t:« lh« itmn* 
9)ujwh (ifft UiHy bimplo tinob Uy ttimpcMiaon lu Ihu&d 
Ih4t aVlually a|»p«*c»f hi the* tttbts whatever mental 



pfocassaa are requictiil t»y U*a*d should ba equally 
nocttbSctry t$ thd muie tompld* Mums that actually 
dppadf in thg feet* 
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tidlliplq items ffuili All%UfOtiuCliQfl fO IPAf 

{Cat tell ±) Cuttufe fan lntdlltg*nte 79$tifiQ 
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For each typTLuf item, there is a separate sot of 
instructions ffo\nstanc:e. if) order to solve items of the 
"Series" type, it is necessary to select the alternative on 
the right that best completes the sequential pattern that 
is displayed on the left The examinee must note, 
amsng other things, that the rectangular dashes on the 
left become longer as one progresses through the series 
from left to right It may also be noted that the increase ; 
in length is constant from one dash to the next 

What sorts of mental operations are necessary in order 
to solv.o items of the "Series* type? In addition to 
understanding the instructions themselves, whicrf may 
presertt some difficulty to the lower range of examinees, 
there are other textual operations that seem logically 
indispensable to the solution of such items. Setting 
aside the question of how the examinees distinguish- 
between the lead-in material (the item stem) and the 
possible answers (the multiple-choice alternatives), 
may we not infer that they must internally represent the 
dashes on the left in some manner or other? Perhaps 
the representations remain visual throughout the 
solution-of the problem, but can it not be argued even if 
this is so that these representations must nonetheless 



serve as propositional subjects with which' certain 
.predicates must be associated? 

For instance, necessary logical predicates of the second 
dash include, (it would seem) the fact that it is the 
second ope in the series d e , that it is positioned to the 
right of the first), that it is longer than the first dash, that 
it is oriented on the horizontal, and that it has the same 
width as the first And no doubt other, facts can be 
adduced Each of me$e facts can be understood a* a 
proposition for which the subject-predicate relation is 
an essential ingredient 

the question is whether these facts can be "known" in 
any sense at all without^some form of propositionally 
complex logic in which to represent them. In addition, 
there is the implicit proposition (or fact) that the first 
and second dashes are objects of the same type, and 
that they form a series of similar objects with the third 
dash. This requires that the "series" be treated as a 
logical subject which* is^ distinct from its individual 
elements Further, there is the proposition that the 
increase in length from dash one to two to three* is 
relatively constant. From this fact it may be inferred that 
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the next item in the logicaf series must have the' same 
properties as the preceding ones. In other words, it may 
be inferred that the same predicates that have applied 
to prec^di rig subjects will apply to the next logical 
subject in the series. (A necessary presupposition for all 
of this would be the proposition that things will ba 
orderly; i.e;, that there is a logically discoverable answer 
to the test item.) ' . 

The item solver may ru[e out choice 2 because it is the 
same approximate length as the second dash in the 
-series. He knows from the*proposition that the dashes in 
\ the series become progressively longer from left to right 
that the correct answer must be longer than the third 
dash in the series. Choice Smay be Med out. because 
the dash in that alternative is not oriented on the 
horizpntal. Here it seems that an implicit negation \s$ 
required (i.e., not horizontal), (Again, it is acknowledged 
that the mentaf operations may, in fact be performed 
on visual images, but does this eliminate the^necessity 
of deep textual structures?) Choice 4 can be eliminated 
because it does not'satisfy the predicate *Df being longer 
than dash three in the series, and choice 5 is too narrow. 
It does not satisfy the predicate of being the same width 
as the other dashes in the series. \ 

While it might ba correct to. argue. that not all of the. 

propositions contained in the preceding remarks are ^ 
" necessary to the solution of the series problem, can it 
X be a/gued that none of them is, required? On the 

contrary, would it not make sense to suppose th§t many 
.* textual forms' may be necessary which we. have not 

highlighted in'the previous analysis? . Moreover, is it not 

also probable that many if not all of the textual 
* structures illustrated will be required for the solution of 

any such problem? ■ ■ \ ■ 

The seco'hd type of item in Figure* 1.. seems to'be 
somewhat simpler than the first. The directions tell the. 
examinee to indicate) which .of the pictured, elements ' 
does not go with the rest. Which oge does not fit? 
Again, the dashes, it would seem, : are logical subjects 
with which certain predicates must be associated. .In 
this example, the predicate of horizontal orientation is 
necessary to the solution. The propositions! operation 
r of negation is equally essential. The problem is, to" find 
which element does not share the predicate(s) shared 
by all of the others. There, may. be some other way to 
represent the problem, but is there any method of 
representation which does not impficitly contain the 
propositional complexities of predication and negation? 

In the third type of p/oblem, labeled "Matrices", pattell 
was obviously .influenced by the matrix format of, the 
Raven nonverbal !Q test. The problem is to'select the 
element from the several- choices that best completes, 
the matrix pattern. The matrix type of item is potentially 
a much more complex version of the "Series" type. With 
a matrix, there are more dimensions of pattern variation 
ancttience a. greater number of propositional stcuctures 
are possible. In the'item at hand; on the third row of* 
Figure 1, the test taker. may note thafthere are various 
progressions in the matrix which may'help to determine 



*just what the missing element must be like. There is in 
fact a progression from left to right in each row, and 
from top to bottom in 'each column. There is even a 
"progression across. the diagonals that' may be of some 
help. The question, it would seem, is what predicates 
must be associated with "each of the elements of the 
pattern in order, to uniquely ^determine the missiricj 
element of the whole pattern? The order in which the 
progressions are noticed may not be. too important, but , 
isn't the fact that at least some of them' myst be noticed 
undeniable? " 

" , In the rather simple problem under consideration, it is 

* sufficient to notice that the two patterns on the left side 
of the matrix are similar both in the direction of theft" 
cross-hatched lines, and in the nuhiber of those lines. 
From this* it may.be inferred that the- ones on* the right 
must.'alsb be similar/ and that therefore the correct , 

: choice is the one which matches the pattern- i ci the top,, 
■right hand corner of the matrix^Thus, choice 1 is the 
^ correct an'swer. Othep strategies would leaS to the same, 
result, but is there^any conceivable strategy that does 
•'. not depend on complex textual operations? That is,\is it 
possible to solve such problems without performing 
mental operations that set up subject-predicate , 
relations which are true of/he patterns ^question? Or 
is it possible to avoid thie negation of certain predicates 
, or the conjo ining of textual mean ings i n cqmplex ways? 

-t In the fourth- type of item, "Conditions," the task is more 
CQmplicated than in the previous items. The problem is 

* to note the relation(s) between the dot and the various 
other elements enclosed ip the figure -to the left and 
then find, the figure on the right where the dot can be 
placed In .the, same relatiori(s) to similar elements. In the 
example itepn^the dot is inside the circle but outside the 
square. The only place where it can be placed in a 
similar relationship to a circle and square on the right is 

* in Choice 3. In all of the other cases, the circle is/ 
"completely contained within the square, and' it is not 

possible to place the dot inside th circle without also 
placing it inside the square at the samfe time. 

As with previous examples, we set aside the question of 
how the examinee represents the fact that th6 larger 
square is hot to be considered part of the immediate ■ 
problem, and all of the other propositions that are 
contained implicitly or explicitly in the. instructions to 
items 'of this type. Nevertheless, it seems necessary for 
the problem solver to employ a vocabulary (perhaps in 
Basic) specifying such elements as DOT, CIRCLE, and 
SQUARE. It may not be' necessary for these elements to 
be represented in the words of any particular natural 
language, but isn't a verbal representation highly likely? 

Roth's research (1978) seems to support the conclusion 
th3t' normal children do f in fact use verbal • 
representations for such problems. But what if they did 
'■ npt? What if the representations were entirely visual or . 
in some other mental form, and not in words associated 
with a particutar natural language? Would this 
eliminate the necessity of textual operations? 
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There must also, it" seems, be an implicit (or explicit) 
vocabulary to distinguish th^ predicates INSIDE, NOT 
INSIDE (or : OQTSlDE), and no doubt,' others b as Uhe 
problems become more comptex. Moreover, there arp ; 
implicit superordinate operations such as NQT and 
AND. To solve, the problem, th^examinee, it'^ould 
seem, must be able in. some manner or other to 
represent internally gropositions' of the form. % DOT 
INSIDE CIRCLE, as welf as DOT NOT INSIDE SQUARE. 
Moreover, these propositions must, be conjoined by 
. some sort gf AND operator .to create a 'proposition that 
has the rough form, Dot INSIDE CIRCLE AND DOT.NOT 
INSIDE SQUARE. 

V. 

Such textual forms have to be .fitted* to the visually 
presented facts. Can we say that they must -be 
pragmatically mapped onto those ? facts? Further, is it. 
not reasonable to infer thjat the actual pragmatic / 
mappings be much more complex than the ones 
sketched out here? 

'.■ v • 
Verbal IQ Items. Some may argue t'hat any analysis of 
"verbal" IQ 'items will be trivially relevant to the 
hypothesis that intelligence may be based in a kind of 
language system. However, there are some good 
.^reasops foHooking closely at ■"verbal; IQ items. For one, 
♦ if the language hypothesis is correct, it should be 
possible to demonstrate that in important ways "verbal" 
interns are stmttHrto^ rronve rbajM t e m s - i n-t he -te x t ua I — 
operations that they require. For another^it should also 
be possible to examine the extent to which such items 
are .distinct from so-called "language:proficiency w items. 

"Verbal" IQ items are, as Gunnarsson (1978) showed, 
sometimes* excessively difficult to distinguish from 
items appearing in sp-called "Reading" test's, and other 
categories of ; "Achievement" tests. Almost always, 
however, IQ items: of the "verbal" type require the 
interpretation and/or .production of words, 'or 
s statements in a particular language. To this extent, it is 
already difficult to follow the logic that Jensen, 
- "Herrnstein. and others do in claiming that such tests 
^Dfimarily measure innate ability, rather than acquired 
^language proficiency. It was the purpose of Language in 
Education: Testing the Tests (Oiler and Perkins. 1978) 
to question that logic. ' In spite of some legitimate 
criticisms of the statistical methods used in that work, it 
seems that the objections raised there agairist certain 
claims of the IQ testers remain largely unanswered. 

At any rate, just what kinds of mental operations are 
required by IQ items of the "verbal" type? A typical item, 
one which appeared in the original Binet tests, requires 
•the examinee to say what is wrong with a given 
. statement such as the following : 

The. judge told the prisoner: are to be hanged 
at dawn. Let this be a warning to you." 

\ the correct answer is any statement to the effect that, 
"He can't be warned if he is dead." The crux of 'the 




problem is a conflict between thje necessary 
consequence that hanging will kilUbie prisoner and the 
equally necessary fact that foaming implies an 
opportunity to repeat whatever offense the prisoner has 
committed. Most adults immediately laugh upon 
hearing the item. But it costs a little thinking' to explain 
why, and even more effort to see the extensive textual 
fabric on which ourconclusicfti depends. 

We may differentiate the network of • inferential^ 
related propositions into three categories: first, there 
are ptesuppreitions related to prior events or states of 
affairs which are crucial if we are to associate any . 
meaning at all wi<h the asserted propositions; second, 
there are associated propositions which aid our 
comprehension, bu\ which are neither before nor after 
: ,the stated assdttibns; and, third, there are implications 
/{ hich are in some sense.subsequent to (or consequent 
ion) the asserted propositions. 

For instance, because the judge hands down a death 
sentence, we may infer that the prisoner has l^een 
convicted of a capital crime. This inference is not a 
necessary one, but it is a logical presupposition which 
normal adults will accept as relevant and probably 
correct A more obviously necessary presupposition is 
that the prisoner has been convicted of committing 
some se/vous cringe. V : 

Associated textual meanings Would have to include the' 
fact that a prisoner-is someone who is detained against 
his Will in a societal institution known as a prison: that a 
judge is someone empowered by the society with 
certain authorities within its legal system. In fact/ the 
presupposed propositions may also have 'associated- 
propositions. For instance, the presupposition that the 

. trial of the prisoner has, already occurred when the 
sentence is being handed 'down is linked to the 
presupposition that the. trial occurred in a .courtroom, 
and that the verdict was probably reached by a jury.. 
^/^Associa'ted wkh the sentence of hanging is the 
proposition that it invpfves' placing a . rope arpund the 
victim's neck, with, the victim's hands restrained, and 

Shen, 4 suspending the victinrvby the rope so that he will 
strangle or die of a broken neck. Implied propositional 

' meanings would include the consequence that hanging 
* kills; the victim andj that a warning implies 'an 
opportunity to repeat the previous offense. 

- No doubt "the foregoing analysis leaves out much more 
of the textual network of assofiiated meanings than it 
makes explicit. However, it. does suggest that an IQ item 
of the type analyzed involves a good deal of textual 
reagonincf of the very sort that goes on in the 
interpretation of ordinary discourse .('for mor§ oh this 
topic, see Rumelhart, 1976^;and Schank and Abelson, 
1977). Is it not an open question whether there are 'any 
meanings'at all whi,ch are<not linked to the fabric of 
experience via the sort of textual network illustrated 
with respectto the hanging example?^ 
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6 For those criticisms, see the various contributions to Issues m Language Testing Research (Oiler, in press). Also see especially Chapter 22 of that 
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Here is another example from .the Binet-type verbal IQ 
test. What is wrong with the statement? 

In a train wreck, the cars at the end are usually the 
ones most severely damaged. ' 

Surprisingly, perhaps, most adults not only do not laugh 
at. this item (in contrast to their reaction to the one 
about the hanging), but in fact, they meet it with a sort 
of blank stare. Many do not see anything wrong with it 
at all on first hearing. However, a bit of thinking will 
show what the test authors had in mind. 

To solve the puzzle, it is necessary to consider what 
usually happens in a train wreck, and then to ask 
whether it is likely that the cars in the back will be the 
ones to receive most of the damage. 

In fact, trains travel* on tracks. This is an associated 
proposition. That is, it is associated with the assertion 
about trains.: Ordinarily, they travel in a forward linear 
direction. This too is an associated proposition which is 
a direct consequence of the linear nature of the tracks. 
What is more, it may be helpful in solving the problem. 
From this it can be inferred that the cars at the front are 
the ones which will logically receive the greater force of 
an impact if there is a collision. 

This inference is an implication-(by_the.definition _gi.veD_.:. 
above) of the nature of trains and their usual linear 
motion in a forward direction. A further implication is 
that the cars at the back are therefore the least likely to. 
be damaged, since they are the ones which are the 
• most protected due to the cushioning effect of the 
intervening cars. 

Of Course, it is true that the cars at the end will receive 
the greatest impact if a stationary train is struck by 
another train from behind. However, the one striking it 
wilLtend to receive the most damage in its lead cars. 
And, in any case, to reason that the statement about 
damage to trains is correct or to reason.on such a basis 
that it is incorrect demonstrates in botk^nstances thte 
need to map propositions into the episocJte-Jacts of 
experience. As in the hanging example, the problem 
. sdfver requires access to much of the textualiy 
.interpreted fabric of experience. There may be a 
multitude of strategies for working out these 
relationships, but this does not deny the importance of 
inferential access to them. 

Here is one final example. In another typical jtem type, 
the examinee is presented with certain asserted 
information and then is asked to answer a question 
based on that information. For instance, the examinee 
might be told: ^ 

Mary is taller than George but she is shorter than 
Sam or Harry. Who is the shortest? 

. Just what sort of mental commutations are necessary to 
the solution of this problem? It seems simple enoughon 
the surface, but there is more to it than immediately 
greets the analyst. 



Consider first how a solution- might be achieved 
through a visual strategy. Suppose we first imagine a 
female named Mary who is visibly taller than some male 
named George. (At this point, two propositional 
inferences about sex which go beyond the asserted 
information have already, been made. We have also 
arrived at the proposition that both Mary and George 
are humans. (It is diffiicylt to imagine how such 
inferential meanings might be derived from a strictly 
visual strategy, but letfus continue.) Then, we s may 
simply add two other male persons to our visual image, 
namely Sam and Harry. (Again, sex assignment is based 
on propositional inferences about {heMikely values of 
the two names in questions.) It is asserted that they are 
both taller than Mary. (However, a.slight difficulty again 
arises for a strictly visual strategy .since Sam and Harry 
are not compared with each other. How can we know 
for certain what height to assign to' either of them? 
Though, it makes no difference to the problem at hand, 
height assignment to Harry and Sam would sqem to be 
necessary. to placing them ip our visual image. Let's 
assume this is accomplished in some manner or other.) 
To answer \he jg^tion, "Who is shortest?", we may 
riow simply scafflB image and pick out George. 

Or, it would seem, a quicker nonvisual strategy would 
be to reason that since George is shorter than Mary and 
Mary is shorter than Sam or Harry, George must be 
: s horte r t ha n Sa m or Harry, and th erefore, the shortest of 
the four. This inference" works because height is a 
transitive relationship, 

No doubt, there are other strategies that will arrive at 
the same correct solution, but are there any that do not * 
rely heavily on inferred propositional meanings? In ^ act /f 
isn't it helpful (perhaps even necessary for so^pr 
problems) to know^what the question is in advance of 
beginning to interpret the relevant facts? Otherwise, 
how is it possible to know what to do with the asserted 
propositional meanings? At least some problem solvers 
report that they need to have another pass at the 
asserted meanings after they know what it is they r are 
looking for. For instance, consider how the problem 
would be changed if the question were, "Who is the 
next to the shortest person?", or "The third from the 
tallest?" Doesn't the need to know what the question is 
suggest that'comprehension itself needs to be guided 
by some sort of propositionally formulated plan? (This 
would also appear to hold for the nonverbal item types 
discussed above.) ■ - ' 

In addition to the reasoning already sketched out there . 
are some other textual inferences that are lurking in the 
background just out of view. For instance, we assume 
that the measurement of the persons in the given 
assertions is done with all of them in a standing 
position, legs straight, head erect, etc. Further, we 
assume that hats don't count that shoes are taken off, 
and that it is unfair for anyone to stand on their tip-toes. 
We do^ot expect the measurement to be done with 
some of the persons in kneeling or sitting position while 
others are standing, and so. forth. Beyond, these 
presupposed meanings there must be a great deal of , 
additional textual structure. - . 



In the light of e\ten this brief analysis, does it not seem ' 
clear that the solution of both "verbal" and "nonverbal" 
items may be dependent on the ^utilization of 
propbsitional meanings of considerable complexity? Is - 
it not clear that presuppositions, associations, and 
implications must be suppliednby the problem solver 
through abstract inferentiaLreasoning? Further, doesn't 
all of this entail the utilization of some deep language 
system which has grammatical capabilities allowing 
such operations as predication, negation, conjunction, 
modification, and the like? 

Educational Implications j 

With the three "what-if" premises in mind thai were 
stated earlier, let us return to the questiofi in the title. 
Are we testing for intelligence .or fcpr language? 
Suppose that some universal Basic language really 
were the foundation of intelligence. Further, suppose 
that the interaction between this sysfem *and the 
progressive acquisition of one or more particular 
language system were crucial to the accessibility and; 
maturation of the deeper Basic system. From such a 
vantage point, it might well be that "intelligence'' tests 
of all sorts are actually measures of the abiftty to 
v equilibrate complex texts in relation to givpn facts; 
Human intelligence may achieve its marvetous feats of 
understanding by translation back and forth between 
the texts of a particular language and representations in 
some deeper Basic language system (perha ps "rendered *^ 
Gestalts at' some level). Within this framework,.' 
adaptation, assimilation, and accommodation could all 
b3 defined as processes of fitting potential 
p opositional meanings to the facts of Experience 
(vlhich themselves are already expressed in a kind of 
language system). 

In the final chapter bf The Human Brain, Wittrock 
(1977) ^contertds' that "imagery" is crucial to the 
acquisition of "verbally presented" content. He cites the 
■■ work of Allen Paivio and others showing that creating a 



meaningful image seems to have a direct and highly 
positive impact on retention and recall in many types of 
learning.tasks! Could this be because of the fact that the 
creation of the appropriate sorts of images engages the 
main text generating machinery of human intelligence? 
According to the framework developed here, 
meaningfulness may depend on the linking of 
propositional meanings in one form with similar 
meanings expressed in different forms. It suggests that 
perhaps consciousness is dependent on translation or 
movement from one language system to another. 

Paivio and others have demonstrated that the recall of a 
verbally presented form may be enhanced by causing 
the subject to link it up* with a visual image that 
somehow expresses the same (or a similar) meaning. 
Could it be that the linking itself is the.principal factor?. 
Moreover, is it possible that this linking facilitates 
retention and recall because it requires an active fitting 
of textual meanings to each other? 



In two dramatic demonstrations, Keislar and McNeil 
(1962) and Wittrock (1963) showed that children who 
dealt with iconic presentation of various facts of 
molecular kinetics succeeded surprisingly well in 
recalling the principles a year later. In Wittrock' s study, 
two-thirds of the children were able to remember the 
criterial concepts a year later. It was surprising that they 
were able to do this since they had only received two to 
-fou r weeks : of'1nstf.uctionrMor^importantiyrthey-were 
believed to be too young to learn the concepts in 
question ih fhe first place. 

Could the translation of texts in one language (Basic) to 
texts in another language (e.g., English or whatever) be 
the essence pf intelligent action? Is experience textual 
in nature?. Can it be understood yyithout inferential 
operations that relate incoming facts to precedents and 
corisequenCes? What if intelligence Were language- 
based after all? „ 
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ADDITIVE VERSUS SUBTRACTIVE FORMSfcQF BILINQUALISM: t 
CONFUSION REGARDING PROGRAMS^ IMMERSION* 

Wallace E, Lambert \ 
McGill University 



Introductory Overview ■ ^ 

Immersion programs for Reaching ^second languages 
and developing bilingual skills were initiated in public 
« schools in Canada some fifteen years ago. Since then/ 
their development modification and implementation 
have been described in technical journals along with an 
, accumulation of evaluative studies of their impact on 
pupils who/proceed from one year in frie program to 
another. Information about the programs and their 
surprisingly favorable outcomes have caught the 
attention of educators in various nations; especially in 
the USA. In order to assess their possible applicability 
to/the USA educational scene, it is jmportant that & 
ciear description of a typical "IMMERSION" program b4 
/widely available, along with differentiations between* 
/ "immersion," "submersion," and "second-language- 
teaching" approaches. Th'iSiis one aim of the present . 
Da per. 



The second is to highlight the fact that immersion 
programs , vyeiie devised basically for native 
English-speaking pupils in Caneda or in the~U.S./£'WSo 
are ceMftin that their development of skills* in trie 
English language would never'be left in jeopardy, either . 
by the educators in charge of the immersiory'pregrams 
or by the home, community and media/environments in 
which they, lived. With the -assurance' of £ solid ■ 
English-language linguistic environment/the immersion 
approach exposes pupils to as much of the second 
language as possible (short of living in a foreign setting)-* 
while monitoring its effects and, the effects'of havirftjl 
little or no home-language instruction on children in the 

- early elementary years of schooling. The programs were 
explicitly not meant for Canadian children with some 
language other than English as the home language (in ■ 
particular not those who have French -as the home 
language) who, one might expect, could profit as well 
from a "reversed immersion".program where they would 
encounter only or mainly English language instruction. 
This reversal was painstakingly avoided because of a 
well-grounded fear on the developers' part that in North 
America (Canada and Quebec included), French or any 
other home language other than English is vulnerable to . 
neglect and replacement. Thus, a French-speaking 
Canadian in a reversed (i.e.. English-based) immersion 
program might move towards bilingualism for a short 
time, but basically he/she would be starting a slow 
"subtraction" of French and its replacement by English 
which, in' the North American setting, can too easily be 
viewed as a more useful, prestigious or otherwise more ■. 
"valuable" language. The cognitive and educational 
difficulties associated with this subtraction/ 
replacement process need to be understood. 



Accordingly, attention will be directed to a distinction 
that has been made, on the basis of available research 
studies, between "additive" and "subtractive" forms of 
bilingualism and^the confusions that wouki arise if 
immersion programs, wffich were designed to • be 
"additive," were reversed and inadvertently made 
"subtractive." Potential solutions to the problem of, 
linguistic' and cultural subtraction, in the case of 
ethnolinguistic minorities, are available. ... 

The Origins, the Nature, and the 6utcomes of 
Immersion Programs 

Origins. It is important to understand the origins of 
immersion programs in the Canadian setting because 
they are much more social-psychological than they are 
linguistic or educational at their cores. Basically,, 
immersion education started because of the desire on 
the part of the original grcpa of parens, educators and 
researchers, who got the first experimental^farograms 
underway, to make Canada a more fair mid more 
interesting society for both "founding \eoples," t 
~French-speak4n§— Canadians— and__English-_sp§aking 
Canadians. Although Canadian in content, this 
development is pertinent to American society because a 
similar social processes ^run their courses in both 
settings. In Canada these are more visible because of 
socio-political movements towards independence or 
^separation on the part of the F/ench«€anadians. 

Although there are many Canadian/American parallels, 
there are still important differences. For instance, 
Canada's constitution has clear provisions for the 
protection of the language and culture of both French- . 
and English-speaking subgroups, and although the 
government has a policy 'favoring^multiculturalism, it 
does not provide extended support for education being 
conducted in »any of the numerous other home 
languages' spoken in Canada. 'Since World War II, 
non-English speaking immigrants make up a sizeable 
proportion of Canada's population. To its great credit, t 
the USA has federal laws requiring educational help — * 
involving teaching via the home language of pupils — 
for all non-English speaking ethnic groups who, it is 
recognized, are placed at^ disadvantaged schools and 
in occupations that presume native competence in 
English." However,^ the USA shows no signs of 
recognizing or appreciating the de facto bilingual 
character of contemporary America which has nearly as 
many families with Spanish as the home language as 
there are people in the total population of Canada. And % 
the English-Spanish bilingual character of contempo- 
rary America is only one strain, for there are various 
other equally' vital ethnolinguistic groups; each 
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contributibg to a fascinating multicultural American 
society, Thoro is, then, much more to be done to- 
cppitalizo on this othnolinguistic richness in bo<h 
.. Amoricaorul Canada. 

French-speaking Canadians have had a long history of 
finding themselves second-class citizens in a social 
world which has reinforced Anglo* American values aad 
the English language. The second-class status showed 
itself in the form of French-speaking Canadians playing 
subordinate' roles to English Canadians,- the dominant 
(subgroup in Canadian society, comparable to 
English-speaking whites in the USA. Not .only have 
French Canadians been grossly underrepresented in the 
.upper levels of Canada-wide status hierarchies, but 
even in the Province of Quebec, where they constitute 
some 80 percent of the population, flench Canadians 
have not, relative to English Canadians, made it 
occupationally or economically, anditheir style of life 
has been ignored, ridiculed and blanpd as the cause of 
their social and economic*position/i he trouble is that 
this type of thinking becomes contagious art^i over time 
even members of thojmarked minority group begin to 
believe they are inferior in some sense- and blame 
themselves for their inferiority (see Lambert, 1967). It 
takes much reflection in frustrating situations of this 
sort to see through the sophistry and realize that one's 
ethnic or social-class group is in no way inherently 
inferior, but' simply that those with the power 
advantages have learned well ' how to keep the 
advantages and that their social-class cushion makes 
keeping- power relatively easy for them. Stereotyping or 
otherwise marking minority groups — people they 
really know very little about — becomes an effective 
way for the majority group to keep others out of thp 
power sphere. , 

As social psychologists, we began to study this state of 
" affairs in Canada some 25 years ago just as two extreme ' 

< solutions to -the "French Canadian problem" were 
*. coming into vogue: 1) French Canadians should pull up 
their socks and compete — meaning they should 
master English pnd Anglo-American ways — while 
toning down their French Canadian-ness; 2) French 
,; Canadians should pull apart or separate — meaning 
they should form a new independent nation where they 
could be masters of their own fate and where the 
French Canadian language and culture could be 
protected. Both alternatives worried us because one 

* meant giving up a' style of life that was precious, and 
the other^eapt closing a society through separation, 
"closing" \ft the sense that Karl Popper (1966) uses the 
term in describing socio-political attempts to create a 
conflict-free subworld where the "good old ways" will 
be protected. Instead we viewed the French Canadian 
way of life as something valuable for Canada as a whole 
— a nation Whose potential and fascination rest in its 
multicultural/multilingual makeup — whether or not it 
was appreciated as such by the majority of English or 
French Canadians. 

The Nature of Immersion Schooling So we became 
interested in reducing, if possible, the ignorance of 
•French Canadian-ness and in enhancing an appre- 



ciation for it among Anglo-Cangdian children. This then 
became the guiding purpose for the research initiated at 
McGill on "early immersion" schooling (see Lambert & 
Tucker, 1972; Swain, 1974; Genesee, 197B-79), 
wherein English-speaking children, with no French- 
language experience in their homes and little if any in 
their communities, enter public school kindergarten or 
grade 1 classes that are conducted by a monolingual 
French-speaking teacher. This "early immersion" ^r 
"home-to-school language-switch" program as we call 
it, is kept exclusively French through grade 2 and only 
qt grade 2 or 3 in English introduced, in the form of a 
language'arts program, for one period a day. By grade 4, 
particular subject matters are taught in English (by a 
separate English-speaking teacher) so that by grade 6 
and 6, some 60% "of instruction is conducted in English 
(see Lambert, 1979). 

The concept of immersion schooling was base^on a 
very important and'fgndamental premise — that people 
learn a second or third language in the same way they 
learn their first and that languages are best learned in 
contexts where the person is socially stimulated to 
learn the language and is exposed to it in its natural 
form. 

From the first encounter, the immersion teacher used 
only the target language. She clearly, patiently and 
repetitively focuses on the development • of a basic 
vocabulary in the new language, relying; with "young 
children, on plastic art materials, songs and animated 
stories. But from the start, the learning of language per 
se is made quite incidental to learning how to do n«w 
and interesting things, with the new language as the 
verbal accompaniment. Later, new ideas of a scientific, 
mathematical or problem-solving nature are given the 
main focus, and again the amassment of skill in the new 
language appears to be incidental, except for short daily 
periods- of language arts which focus on the new 
larijguage itself (see Lambert & Tucker, 1972). The 
teachers' aim is to cover fully the content matters 
expected of any child in a conventional pro'gram at that 
grade level. 

Immersion classes normally comprise only Anglophone 
^^ptfpils; thus, the whole class experiences immersion as 
a group. In some cases, a few children who are native 
speakers of the target language are introduced into the 
otherwise Anglophone class, and their presence can be 
useful in ^many ways. However, one has to choose 
carefully those who are to represent the other ethnic 
group; a biased selection could easily upset the 
development of positive attitudes toward the other 
group (see Lambert, 19B2). 

We refer to situations where a very few Anglophone 
children are placed in an otherwise all French language 
class (one] comprisecT of Francophone pupils onlyK as 
"submersion" instead of immersion. The chances for 
more rapid development oppressive language skills in 
the foreign language are obvious in submersion in 
comparison to immersion programs, but being few and 
beiQjg^aifferent might present* social adjustment 
problems not encountered in immersion classes. 

i # 
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Immorsion and submersion programs both Jocus on 
subject-matter mastery and make language learning 
incidental, and thin similarity makes both'quito difforont 
from socxmd'lanyuacjo leaching programs (e.g., Fronch- 
as-a-socond-languago) where the new language is the 
focus, whoro content-mattor mastery is not amain goal 
and where only small amounts of time are devoted to 
the seconcMonguago component. That component is 
also usually covered by a specialist/ rather than the 
grade teacher. Thus, immersion programs are much 
more intense and comprehensive than >socond- 
language programs, and sinco no specialists are 
involved, tho costs of immersion programs are hardly 
any different from normal costs since the class teacher 
is also the language specialist and the class size (e.g., 
,30-32 pupils to a teacher in Canada) is usually kept 
normal. ' . 

Immersion differs from typical "bilingual education" 
programs fo's these are conducted in North America) 
because no bilingual. skills ace required- of the teacher 
Iwho presents herself as a monolingual in the target 
language) and who, therefore, never switches 
-languages, reviews materials in the other language, or 
otherwise encourages bilingualism in the teacher-pupil 
interactions. In immersion programs, bilingualism is 
therefore developed through jnonolingual instructional 
routes. 




Outcomes: Educational and Cognitive. The consistent 
findings from fifteen years of careful research on- 
children in immersion programs permit several 
conclusions which bear not only on the linguistic 
consequences of these programs, but the psychological 
and social consequences as well. 1) Immersion pupils 
are taken along by monolingual teachers to a level, of 
functional bilingualism that could not be dupNpated in 
any other fashion short of living and being schooled in a 
foreign setting. Furthermore. "pupils arrived that level 
of competence 2) withoutdetriment tohome-language 
skill development; 3) without" falling behind in the 
all-important content' areas o/ the curriculum, 
indicating that the incidental acquisition of French does 
not disWct the students fr^m learning new and 
complex ideas; 4) without an^form of mental confusion 
or' loss of normal cognitiv/growth; and 5) without a 
loss of identity or appreciation for their own ethnicity. 
Most important of all ir/the present context. 6) they also 
develop a deeper appreciation for French Canadians 
and a more balanced outlo&k towards them by having 
learned about tKem and their culture through their 
teachers and through "their developing skill with the 
language of^ench Canadians. 

Instructive illustrations of the* effect of immersion 
schooling on students' cognitive developmental now 
available, even though, because their sample sizes are 
small, they should be viewed as suggestive, rather than 
d/fini'tive at this point. One is a study by Scott (1973) in 
Montreal who compared two subgroupings of children 
who were in, all respects a single homogeneous group 
until grade f. s at which time one subgroup entered an 
immersion program, while the second subgroup of 
closely comparable youngsters had not been given the 



immersion opportunity. Scott worked with data 
collected over a sovon-year period from two groups of 
English-Canadian children, one which had become 
functionally bilingual in French during tho time period 
though "immersion schooling" in French, whilo the ■ 
second grdup had followed a* conventional 
English-language education program, Scott focused on 
the possibte effects that becoming bilingual might have 
on -"divergent thinking." c/ special type of cognitive 
flexibility (see Guilford. 1950. 1956). Moasures of 
divergent thinking provide subjects (in this case, 
children) with a starting point for thought - "think of a 
paper clip" — and then ask them to generate a whole, 
series of permissible solutions - "tell me all the things 
one could .do* with it." Sofne researchers have 
considered divergent thinking as an index of creativity 
(e.g., Getzels and Jackson. 1962). or^at least an index of 
a rich imagination and an ability to scan rapidly a host^ 
of possible solutions. Whatever the term should be. the 
results, based on a multivariate analysis, showed that 
the)functionally bilingual immersion pupils, as of grades 
5 anfj, 6. were substantially higher scorers Jhan the 
rioniwmersion controls with whom they . had_ been 
equated for IQ and social class background at the 
first-grade level. Althod^h the numbers of children in 
each group are small, this study Supports a causal link 
between bilingualism' and flexibility, with bilingualism 
one of the factors that ssem to enhanjee flexibility. 



Supportive evidence is also found in quite independent - 
studies of the cognitive effects of immersion programs 
on Anglophone Canadian children in other areas of 
Canada, namely the work of Barik and Swain (1976) in 
Ottawa and Toronto and Cummins (1975. 1976) in 
Edmonton. Alberta. Both of these studies show 
increases in IQ or in divergent thinking scores that can 
be attributed to the development of bilingual skills 
transpiring in immersion s|f>ctoling. 

t There is. then, an impressive "array of evidence 
accumulating that argues plainly against the common 
sense notion' that becoming bilingual — having two 
linguistic systems within one's brain — naturally 
divides a person's cognise resources and reduces 
his/her efficiency of thought and/or language. Instead, 
one can now put forth a very strong argument that 
there are ^definite cognitive, educational and social 
advantages to being bilingual. These, advantages are 
experienced as much by. children from working class 
socioeconomic backgrounds as the more advantaged' 
anjd for children with v^arious levels* of measured IQ. 
including children vyithttia^hosed learning difficulties. ' 

Outcomes: Sociocuftural Awareness. What is exciting 
about this program, over ajid above its educational and 
cognitive impact, is that it opens children's minds to an 
otherwise foreign and possibly threatening. outgroup. It 
also provides certain sociopolitical' insights that 
4 monolingual mainstreamers .would likely never have. 
For example, the immersion children come to the 
realization that peaceful democratic coexistence among 
members of distinctive ethnolinguistic groups calls for 
'something more than simply learning one another's 
languages (see Blake. Lambert, Sidoti, & Wolfe, 1981; 



ERIC 



-33- 



Cziko. Lambert, Sidoti, & Tucker, 1979), Having learned 
the other language well and having learned to 
appreciate the k other cultural group, children with 
immersion experience, compared to control, realize that 
effective and peaceful coexistence calls as wefll for 
opportunities for both ethnic groups of young pebple to 
interact socially on an equitable basis. This is a very 
sophisticated insight. 

Immersion Programs in the USA 

Thus, a new approach to bilingual education is now 
available, and since it works as well in other parts of 
Canada where few if any French Canadians are 
encountered in social life (see Swain, 1974), it or some 
variation of it can be expected to work equally wqjl in 
the USA. In fact, there are currently some ten or more 

7 communities in the USA where comparable early 
immersion programs for mainstream English-speaking 
children are underway (in Spanish, French and German, 
so far), and from all available accounts, they are working 
splendidly (see Cohen, 1 976; Samuels & Griffore, 1 979; 
^Montgomery-County-Public— Sehoolsr— 1976;- Derrick, 



1980; Grittner, 1981; O'Connell, 1981; Sidoti, 1981). 
Part of the reason for their success is that school 
administrators and principals, after an initial period of 
» skepticism and wariness, become extremely pleased 
and proud of the outcomes. Furthermore, the costs of 
the programs are surprisin gl y low compared to 



second-languagfe-teaching programs because the 
regular teachers' salaries go to the new "foreign 
speaking" teachers. 

■ I 

But what really counts as success te the pride and 
progress reflected by teachecs, parents and pupils. For 
example, Frank Grittner the Supervisor of ,Se^ond 
Language Education for the State of Wisconsin, has 
Collected data on third grade English-speaking children 
(few with German ethnic backgrounds) in a German 
immersion program where they were taught through 
German for three years. That particular immersion 
program was related to a plan for desegregation, <and 
thus, some 40% of the pupils involved were black. At, 
the end of grade 3, 100 percent of the Ger/nan 
immersion pupils scored in the average to above 
average rai^ge on the Metropolitan Achievement Test 
for Reading On English) compared to. 7Q, percent for 
Milwaukee schools in general and 77 percent for U.S. 
norm groups. Likewise for Mathematics Test scores 
(also tested through English), the respective averages 
were 92, 71 and 77 percent. Similar outcomes are 
available for English-speaking American children in a 
French.immersion program in Hollistoh, Massachusetts, 
as of the end of grade 2 (O'Connell, 1981). In New York 
City where an interesting program of partial immersion 
in Spanish for English-speaking pupils \>as been tried 
out, the end-of-year parental/esponses and evaluations 
are extremely favorable. What characterizes * the 
parents' reaction is the delight they show that their 
children- are learning about Spanish-speaking people 
and developing an appreciation for them at the same 
time as they acquire the basics of the language. They 
are pleased not for "instrumental" reasons but for 
"integrative" ones, ire., intergroup harmony is initiated, 



not that their children can Arofit in tfje business world 
by knowing Spanish ($idotU981). 

Furthermore, there is strong evidence to show that 
monolingual Anglo-Canadian children can handle easily 
a "double immersion" program wherein two totally , 
unknown languages (in this case French and Hebrew) 
are used as the main languages of - instruction from 
kindergarten through the elementary grades (se0 
Genesee & Lambert, 1982). Incidentally, the striking 
success of these double-immersion programs \ in 
Montreal schools make one think twice about Canadten 
policy makers who give verbal support to promote 
multiculturalism, but stop short -of providing at least 
some instruction via home languages, The point is that 
ethnic minorities in Canada might easily handle and 
enjoy education that is trilj^gual — French, English and 
home languages — just as the Jewish children in the 
double immersion programs not only manage, but also 
enjoy education that is French, Hebrew and English. • 

The variants of the immersion program that might be 
valuable and relevant when applied in the USA are 
limrled only by one's imagination. For instance, the New 
York City variant is a partial immersion program that 
can be increased ir} time devoted and in scope to satisfy 
large numbers of ^pupils with a variety of language 
optiqns (see Sidoti, 1981). Then there is an extremely 
interesting "Language to Share" program, wherein 
^elMfc^Tninorfty-edcMesceata-^are trained to be^junior 
teachers of their home language t0 Pupils two or three 
years younger than themselves. Similarly, there are 
pqsSibilities fpr "language exchange" programs 
(Lambert, 1978b), wherein speakers of English, for 
example, who fcre interested in learning particular 
foreign languages are paired up with peers who have 
those foreign languages es their home languages and 
who exchange two or so hours per week in teaching 
their home language informally while receiving English 
instruction in return. The exchanges are coordinated by 
a small group of master teachers. 

What this all means is that there is now available an 
effective means of developing a functionally bilingual 
0 citizenry, The evidence, both scientific and anecdotal, is 
"consistenyenough that many of us are confident that 
such prodrams will slowly but surely be implemented in 
the USiflf and other major sites. Our concerns now shift 
toBojmier domain, i.e., tbat immersion programs will be 
instigated without sufficient attention given to certain 
social-psychological processes that come into play 
simultaneously with language learning. Such processes 
are at the base of what we refer to as "additive versus 
subtractive" forms of bilingualism. 

Additive Versus Subtractive Forms of Bilingualism 

In the descriptions just given/it is important to keep in 
mind which segments of the Canadian and American 
societies have played the major role in immersion 
programs. In each ^setting, it has been the 
English-speaking Canadian and the English-speaking 
American mainstream families — those segments of 
the respective societies that are most secure in their 
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own athnic and linguistic Identity, but the ones moat In 
need of knowledge about and appreciation for other 
ethnic and linguistic groufcs= To the oxtent that 
mainstream children uro sonsitizod to and oducated in 
othor languages and culturos, the bettor the chances 
are of developing a rich, harmonious, pluralistic socioty. 
The better too are the chancos of improving tho 
self-views of ethnolinguistic minority children who aro 
. immensQly heartened and complimented when they 
realize that mainstream children are making, sincere 
gestures to learn abovtf them, their language, and their 
ways of life. f - 

We have referred *o this process of developing the 
bilingual and bicultural sic ills of English-speaking 
Canadian or American children as an "additive" form of 
bilingualism, implying. that these children, with no fear 
of ethnic/linguistic erosion, can add one or more foreign 
languages to 'their accumulating skills and profit 
immensely from the experiences - cognitively, 
Socially, educationally, and even economically (see 
Lambert & Tucker, 1972; Lambert, 1978a). Developing 
strong skill in a second, socially relevant language 
would enlarge these pupils' repertory of. skills, and 
rather than* detracting from their English home- 
language base, their skill with English appears to be 
enhanced. For these children and their parents, it 
becomes clear that the learning of the secojjo language 
in no way portends the slow replacement of the first or^ 
_ "h ojrie!Janguage^5-- would-be - the— case^foi^ most - 

linguistic minority groups in North America who are 
' pressured to .develop high-level skills in English at the 
expense of their home languages. Thus, we refer to 
these experiences as examples of "additive" 
bilingualism, and we draw a sharp contrast with the 
"subtractive" form of bilingualism experienced by 
( ethnolinguistic minority groups, who, because of 
* naticfial educational policies and/or soiial pressures of 
various sorts, feel forced to put aside or subtract out 
their ethnic languages for a'more necessary, useful and, 
prestigious national language (Lambert. 1 974). l 

In the subtractive case, one's degree of bilinguality at 
. any point in time would likely reflect a stage in the 
gradual disuse of the ethnic home language and a 
fading of « tji^t^-language's associated cultural 
accompaniments; and their replacerpsnt- with another 
more "necessary' language and a new cultural 
accompaniment. This form of bilingualism can be 
devastating because it gradually makes the child's 
basic conceptual thinking — to the extent that jt is 
linked with or colored by-the home language — no 
Monger relevant or functional. In a sense, the child is 
expected to redevelop basic concepts with a new 
linguistic accompaniment, while trying to keep up with 
native, English-speaking peers. Irj other words, these 
youngsters are placed in a psycholinguistic limbo 
where neither language is useful as a tool of thought 
and expression, a type of "semi-lingualism," as. 
Skutnabb-Kangas & Toukomaa (1976fput it. Although 
not yet fully understood, this subtractive phenomenon 
could become a central issue in the psychology of 
\ bilingualism. 



The chio of French .find English in Montreal is 
interesting because both additive artd subtractive 
features ' are involved. For Anglophone Quebocers. 
learning French is.clearly additive in nature, with no foar 
'of a loss of identity * or of French eradicating 
English-language competence. Since Francophone 
Quebocers comprise Some 80 percent of the population 
and have their own French-language school system 
from kindergarten to the most advanced professional 
institutions, learning English might also be thought of 
as additive. From a Norths American perspective, 
however, Quebec is a small FreJhch-speaking enclave 
that is continuously bombarded by English language 
media, with pressures on its children to prepare 
themselves for life in an otherwise English-speaking 
semi-continent. For Francophone Canadians outside 
Quebec, the chances of keeping French alive as a home, 
school and work language are slim, This fear of a 
subtractive loss of Frenchness is reai for many 
French-speaking Quebecers as well; a too ardent move 
toward Englishness might well subtract out Frenchness. 

The research of Taylor. Meynard, & Rheault (1977) 
indicates how sensitives certain French-speaking 
Canadian subgroups are to a possible loss of ethnic 
identity when they are either forced or enticed to use 
English instead of French as a language of work and/or 
a language of thought. Thus it becomes clear how 
learning the other group's language through immersion 
schoo l ing means o ne thing f6r the et hmcalty~ami 



linguistically comfortable Anglophone Canadian, but a 
quite^ditferent thing for the ethnically and .linguistically 
insecure Francophone Canadian. In one case, 
immersion-in-French offers numerous advantages to 
Anglophones, through an additive process, while in the 
other case. immersion-in-English would bea menacing 
reversal of jmmersiog schooling, which, through a 1 
subtractive process, could place a social minority 
group's language and cultural identity in jeopardy. 

It was theie examples of different expectations and 
worries to be found in Francophone versus Anglophone 
communities^ in Canada that led Lambert and Tucker 
(1972) to propose a "general guiding principle: In any 
community where there is a widespread desire or need 
for a bilingual or multicultural citizenry, priority in early ' 
years of schooling should be given to the language or 
languages least likely to be developed otherwise, in 
other words, the language(s) most likely to be 
neglected" (1972, p. 216). This would mean having both 
French- and English-speaking Canadian children start 
their schooling in French in Canada, the language more 
likely to be neglected or bypassed; this would provide 
an additive immersion program, for Anglophone 
children while protecting Francophone children from 
.the -subtractive drift to English.. Once their home 
language is established as an active language of 
thought and expression, the Francophone children 
could gradually start a part-time immersion-ijvEnglish 
program to assure them ' an eyentual bilinguality, 
comparable to that of the young Anglophones learning 
French through immersion. Lambert, apd Tucker believe 
that this principle holds for any setting; it is a matter of 
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determining which liinQimytits) have high status and 
utility and which low in tha given community, and thon 
^ substituting now languages lor Inglish nnd Fiwm h in 
the Canadian example" 

Tranformlny Subtractive Bilinguulism into Additive 
pillngualitm 

The guiding principle |ust mentioned can be applied in 
anothor mannor. One starts with tho reasonable 
proposition that tho major aim of education inj North 
America should be to brighton tho outlook lor 
ethnolinguistic minority-group chtldroil by preparing 
thorn to compete lairly with mainstroom childron irr 
educational and occupational pursuits. As potential 
bilinguals, they certainly have the cognitive and 
linguistic potential as the research alroady mentioned 
shows. The best way I can see to release that potential 
is by transforming their subtractive experiences with 
bilinguahsm and biculturalism into additive ones. ♦ 

We already have' a few research-based examples of 
h ow t h isiranSlormatiOn ffrfgh r wof Ir^he ^tirs t ~Ts t he" 
case of Franco^Americans in northern New England 
who recently were given a chance to be schooled partly 
in Weir home language (Dubo and Herbert, 1975a and 
1975b: Lambert, Giles and Picard, 1975: Lambort, Gilos 
and Albert, 1976). Some 85% of families in the northern 
rflqirtns of Maine h avn knpt French_alive as the home 
language or as one of the two home languages, even 
though traditionally all schooling has been conducted 
in English. We participated in an experiment wherein a 
random selection of SChoots and of classesjn the area 
were permitted to offef about a third of thVelementary 
curriculum in French and a second sample of schools, 
with children of comparable intelligence scoros and 
socioeconomic backgrounds^ served as a control or 
comparison in that all their instruction was in English. 
After a five-yetfr run, the children in the "partial French" 
classes clearly outperformed those in the control 
classes in various aspects of English language skills and 
in academic content, such as math, learned partly via 
French.. At the sametime. French had become for them 
something mote than an audio-lingual language 
because of the reading and writing requirements of the 
French schooling. These results mean that the 
French-trained Franco-American children were given a 
chance to be fully bilingual and this had repercussions 
on their cognitive abilities and bettered- their 
opportunities to compete in occupations or professions 
that call for high-level educational training. They had 
been lifted from the typical low standing on scholastic 
achievement measures that characterizes so many 
ethnolinguistic groups in North America. 

An important element in this transformation was the 
change^ in the self-views of the French-trained 
youngsters who, we found, began to reflect a deep pride 
in being French and a realization that their language 
was as important a medium, for education as English 
(Lambert. Giles and Picard'. 1975). Similar 
community-based studies are underway in the 
American Southwest, and these, too, are based on the 
belief that ethnolinguistic minorities need a strong 
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ttduoatinnal experience in their own languages and 
traditions before thoy can cupe in an "all Am«n( arf 
HociDty ui hrtluiM thoy will want to copo in. such a 
society 

A second example of a transformation ol aubtractrve to 
additive bilinguahsm is provided by Carolyn Kessler and 
Miiry Guinn (1000) In thtm study, Spanish speaking 
Texas grade (i students were aiven the opportunity in 
olurnentafyHChpol to loarn flubjoct matters via Spanish 
wfiTWj loarning English, that in, hko the Franco- 
Americans in the first example, to use their home 

language tho language through which their basic 

cofKoptuul thinking doveloped in infancy — as one of 
the linguistic media for further conceptual qrowth Tho 
ftispanic-AmortcarK students woro compared with a 
much moro privileged sample of middle class, white 
monolingual English-speaking American pupils of the 
same age 66th groups were given an extensive training-/, 
program in "science inquiry" through films and 
discussion of physical science problems and hypothesis 
testing. In tests given after the training, it was found 
throt^ the— Spanish-English -bdmguate— generatecLL 
hypotheses of a much higher quality and complexity 
than did the monolinguals. This problem-solving qu^hty 
was also reflectod in the language used, as indexed by a 
"syntactic complexity" measure, so that the bilinguals 
woro clearly using more complex linguistic structures 
as well. They also found substantial correlations » 
between their measures of hypothesis quality and 
syntactic complexity, providing thereby an important 
link between problem-solving capacity and linguistic^ 
skills. 

The research by Kessler and Guinn jibes' nicely with 
other findings. For example, Padilla and^Long (1969) 
found that Spanish-American children and adolescents 
can acquire English better and adjust more efffectivejy 
to the educational and occupational demands of 
American society if their linguistic and cultural ties with 
the Spanish-speaking world ere kept alive and active 
from infancy on. There are jn fact numerous recent 
examples (Hanson, 1979: McC6nne(LJ-980;'< Rosier & 
Holm, 1980; Troike, 1978) that ptfTnt in the same 
direction. G.R. Tucker H980) recently summarized these 
studies cind concluded that there is "a cumulative and 
positive impact of bilingual education on all youngsters * 
when they are allowed to remain in bilingual programs 
for a period of time greater than two or three or even 
five years and when there is an active -attempt to • 
provide nurturahce and sustenance of their mother 
tongue in* addition to introducing teaching via the 
language of wider communication" (pp. 5-6). 

Conclusions 

A new form of education ia developing around 
immersion programs, already well known in* 
Anglophone communities across Canada and,, now 
being scrutinized carefully for possible adoption^ by 
educators, parents and researchers in the USA. The 
0 innovative feature is that we now realize that solid 
education and better-than-normal cognitive 
development can take place in classes where a second 



totally fi>r«jitjM langUrtiJa uhrtl ' 1,1,1 ,IUi M ul on,v 
medium of tmtuu linn arid learning Mum Hie initial 
t»t In ml grade 1 . <"» "lily i|f culiicttlv U'v»i»vl * (| V V 1 
conventional i riyttsh lanyuatj" instruction hum umdtt -* 
Of Si) Oil. but nonothele?>3 kept «ir» an active alternative 
laiujuayw c jf instruction anij lertininu, thiuuijh hujh 
tu houl 

rhtthu conclusions hold, without eruption ar> fai fib wo 
t:an determine, only for I ntjlmh npoakmu children » f ^ 
Canada and the USA, tha fiauments uf American hucioly 
(i>r whom immeihiun in cinuthur lanyuatja programs 
wore developed It i ^ now dear that Arujiophono 
yuurujstors can keep up with or surpass control children 
tn conventional classed in their performance un 
curriculum content, arfd thorn afti v no neyativu effect* on 
English languago development, cognitive development 
or general educational attainment Instead, their English 
skills tfhd cognitive growth art) strengthened, relatively 
and than ethnic identity is likewise better moled arid 
made open to other ethnicities 

All of those advantages, are soon as a form of, 
enrichment derived from the "addition" of highly 
advancod skills with a now languago and a doep. 
understanding of a different ethnolmfluistic group The 
enrichment ts additive becauso those children have no 
fear or worry in North America of losing English as their 
basic languago j^f thinking, 'expressing and pr#)lem 
solving Nor need they have -worries that they will lose 
thee American identity in the process , 

In contrast, immersion programs were not designed or 
meant for ethnolinguistic minority groups in North 
America who have some language other than English 
as the mam home language To place these children m 
an all-English instructional program would bo to reverse/ 




the immersion in a harmful sub bar live way H^M 
-personal identities their early conceptual development 
(hell i hrtfudb uf turnpetmy or suceeding and their 
interest in trying to succeed would all tie hampered by a 
f«veised immeioiOH in I nylish program 

ruilunalely. there, are practical, aruj^valuable 
alternatives now available to help these children 
lr<¥isfiimvAd potentially sutitracnve form of bilingual 
development to an additiye one. with advantages 
similar to those enjoyed by the mainstream Anglophone 
in normal immersion programs These transformations 
are not based on typical "bilingual education" models, 
but rather on a dual track education model that 
emphabi/eb the use of the non-English home language 
as the major instructional languago in the early Qrades 
and. when the time is right, introduces a separate 
English-language instructional component when it is 
certain that the child's home language has taken roots 
and is a secure bane for adding the new languago This 
stage'may not be reached until grade two or three This 
alternative brings two traditionally disparate social 
groupings - Anglophone mamstroamers and 
ethnolinguistic minorities — together around an 
exciting new form of bilmguahsm and biculturalism 

For some, especially the older generations, such an 
emphasis on bilingual or .multilingual/multicultural 
development may seem un-American or socially 
dySfunctionafand impractical. For others, especially the 
- ydung ^spTnt/-Tt^3Y^piw 

■yhevV coping sjyle that is taking shape in corJBmporary 
North America where sizeable subgroups of young 
people are not only inquisitive about the rich 
multiethnic societies they find themselves in, whether 
in Canada or the USA, but also anxious to become 
actively involved in thisethnic richness 
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Introduction in the First Person 

As my hand pushes this pen/ and my hand puts 
movements into words, and the words are developed 
into concepts and images, my values are in mo^st^vaVs 
. written down. These thoughts can be transformed into 
t images and are legitimated by some sort of logical 
'.process.. My universe of knowledge is applied by 
language, thought and action because what I do is - 
"truth" that is known to me. But Lam not alone on this 
earth, and I must live with other human beings whose 
images are 1 similar to my own. Many times we, as 
* -historical living creatures, must get together and agree 
on our value systems in an organized way. Simply, and 
accepting aH jhe complexities involved, we can call this 
union of agreement the institutions of our. society. 
These entities cannot " exist without human beings, 
human beings with feelings, human beings with 
collective histories, human beings with individual 
biographies, human beings with certain traditions. With 
these common agreements of what is and what should 
be, we as part of the human. endeavor must rationalize 
our actions bV creating a norm. Our logic s.e§ms correct 
arid when drastic change is implied, change that may 
destroy our universe of thought, we resist. If it is change 
that "fits" somewhere into our existing schema, if it is' 
change that can be legitimated in our domain, we 
accept it, sometimes with reluctance, and* then we 
develop a "new" norm or univers t e of knowledge, but we 
forget one thing: VVe are creatures tied to a history, a 
language system, a social milieu, a culture v^hich we 
cannot deny. But what happens if one possesses two 
cultures and 6r\e knows that the twg values may be. 
images associated with the same word? Let us explore 
some of these issues in what ( I call the"*"'* Image 
Production in Two Languages: Its Implications in 
Testing the Bilingual Child." . ■ * 

s For a decade I have talked about the concept of image 
production and language use to my colleagues, to my 
students, and to my friends with no real satisfactory 
in-depth* determination of what my field (curriculum 
theory-bilingual education) had to offer as it related |o 
knowledge-learning, knowledge-knowing, and thougljf.. 
I knew that" eventually a more adequate conceptual 
schema. defining the relationship of language use and 
image production would be developed, but until this 
happened, I continued to read in the social sciences for 
a satisfactory answer. It did not come. 



As a curriculum theorist with an interest in phenome- 
nology as. it relates to language education, I have 
always been interested by how society distributes 
'knowledge arid how.knowledge is valued or prescribe^. 
Reading Ponte, Shutz, Heidegger, Camus, Sartre, 
Witge.n stein, and others gave me a clue of what 



knowledge is and how it relates to how one functions in 
the world through language and thought, but I was 
never satisfied with the personal dilemma of "talking- 
about" image production in more than one language.. , 

Linguistics helped me to "Ibok-af the structure of 
language; sociolingiiistJis attended to the interpre- 
tation of language, and the use of it in social situations; 
Bergerand Luckmann (T967) in- the Social Construction 
of Reality helped me to interpret the distribution of 
knowledge from the/viewpoint of sociplogy: psycho- 
logy attempted to describe and analyze from both a 
descriptive and statistical paradigm of language and 
behavior; anthropology looked at the' nature of 
humanity and how it functions in different societies; 
and yet, this corpus of knowledge, as well as knowledge 
from the other social disciplines, failed . to satisfy a 
definition, of image production in more than one 
language. . . 

Eleven years ago while working at Hunter College of the 
City University of New York. .in the Department* of 
Curriculum and Teaching, I met a faculty member who 
was in a field called bilingual education. Although I was 
interested in. cross-cultural studies, I read superficially 
in bilingual education. At : that time, I was interested in 
culture and valped knowledge and I thought that the 
field of bilingual education could be seen as another 
perspective inxross-cultural studies in the bilingual 
situation. I never ditectly deait with a teaching concept, 
which assumed that the social environment had to be 
viewecrtrom a multilingual/multicultural perspective^ I. 
pondered the idea with some interest' and finally 
: accepted the challenge. At about the same time, I was 
mvited to spe^k at a Portuguese 1 - conference in 
Massachusetts ^about Portuguese children who had 
entered a new culture. At that conference, I met a future 
friend who was interested in linguistics, especially in 
the area of language and thought in bilinpual situations. 
This second encounter transformed my interest in 
bilingual education into a serious inquiry as I started to 
realize that this area. of study had something new to 
offer in the curriculum.field. What it was, fdid not know,, 
but I had an urge to find out. Finally, I immersed myself 
in the literature. I learned that some of it was sheer 
nonsense, but some of it opened the doors to a new 
paradigm; a paradigm that, for me at least, could 
possibly describe* the relationship between language 
and thought in the domain "of the distribution, of 
knowledge, especially in the area of image production 
and testing. " 

One day in my office, I was reflecting about language 
and culture and I started to compare my cognitive 
systems in such a way that I could view the world or 
valued knowledge(s) from two different perspectives; I 



was feeling through cognition and langgage-meanirig. 
and how it shifted from one culture to another. To say 
this was my first experience would be false. Most 
bilingu^-bicultural individuals knbw |his phenomena 
by early adulthood, but it was the fiVst tirpe in my life 
that I knew that I was onto something new. For nr)e, at . 
least, rtawi could seriously think out (or think through) ~; 
the phenomena of being bilingu&ljffhp encounter made 
me realize that it was possible to have different images 
for the same words in different languages. I realized that 
norms were based % on a collective agreement of a 
specific distribution of knowledge ' legitimated in a ^ 
statistical paradigm. Language and thought could be 
combined into images, jmagep4hat represents collec- 
tive agreement df what is a "co^neot^esponse. 

The purpose of 'this presentatteFKfsTjhen to explore the \ 
issue of language o ancJ/ thought through image 
production in testing the bilingual child. F;or this paper, 
image production is defined^as verbal expressions,' or 
explanations, that are'used to describe or explain a 
specific -concept of behavior. In addition, using the 
concept of image production, using the concept. of the 
corTjpound/cpordinatj^paradigm, and combining these 
mentioned notions in the .domain of language usage 
and (cultural) behavior, this paper will attempt to 
expfore assessment and 'testing issues in the field of 
bilingual eduegtiop. The following assumptions will be 
analyzed for the purpose of investigation: - 

1) . A .One-io-one correspondence in language 
. translation does not always take place 
' especially whe/f one uses images for the 

" ■ ■ purpose of test construction. ' 

2) Language assessment in a -particular language 
may necessitate the "proper" images that are 
socially constructed frdm a particular language 
group or social dass. 

3) Image production may be a new way of 
analyzing language, bilingualism, 'assessment 
and testing. 

4) The Hidden Curriculum and Political Ideology. 

t , ■ ■ ' 

As noted in the introduction, the metpodology for this 

•paper will *be borrowed from the reconceptualist 

movement in curriculum' theory where most of the 

paper written in the first per|op. This allows the. 

author/researcjier to explore and combine the existing 

data in research with a personal or biased point of view. 

This methodology, also allows one to explore the 

affective, as well as the cognitive, dbmain in the field of 

language assessment The theoretical framework of the 

chapter will be from the point of view that all language 

and knowledge is socially constructed. I will not 

attempt to dwell on the political aspects of language 

policy and testing in tfie schools. However, many of my 

examples may be interpreted as a political expression. 

Also, I will not attempt to dwelt on ethnicity, but I 

cannot "lock myself out" of this, process of thinking 

when one describes language, - images through 

language and the relationship-of language and cultural 



behavior or cultural reproduction! What I will attempt to 
do- is to "flow, through and around" possible ways of 
explaining what i*-is to be a "balanced" bilingual and 
how this phenomena coyjd be used to analyze image 
production and knowledge use in testing the°t>ilingual 
child, * V . V . 

.. • • 

What is important is to discuss the concept of valued 
knowledge and how valued knowledge affects the 
individual when he/she is aware that one concept used 
in two different ..language systems is valued (felt) in 
different fundamental ways. Let us now explore some of 
these issues from the point of view that image 
production may be a new way of developing a paradigm 
for fujther research in assessment and test construction 
for the bilingual child. 

>A One-to-One Correspondence in Language Translation 
Does Not Always Take Place Especially When One Uses 
Images for the Purpose of Translating. 

v Apple^nd King (1979) hint at the problem of valued 
Ks^wledge through the definition, of the hidden 
curriculum, but they limit their hypothesis to language 
control in a monolingual situatien vis-a-vis social class;' 
a fact of which many bilingual educators are well 
aware. Thfey state: "Just as there is a social distribution 
. of cultural capitals in society, so^Jherg is - a social 
9 '"distribution of knowledge witjjrn classrooms. For 
Sample, different 'kinds' of y^tudents get different 
'kinds' of knowledge." For the curriculum theorist who is 
in the bilingual field, the above assuTrityion- is a 
self-evident, political reality. It is knovyn that knowledge 
and how it is used relates directly to pedagogical 
principles in a multilingual, multicultural school 
experience. In the field of bilingual education and in 
testing the bilingual child,; the problem is the 
distribution of knowledge of the two cultures" or two 
cognitive styles and how the educator can treat 
different value structures in an educational setting. 
Curriculum theorists must confront this issue so they 
can develop a viable resolution in the area of 
pedagogical methods, and the area of two languages in 
the classroom, i.e., the relationship^ language and the 
social distribution of language. • 

Why .is. the above . discussion important? If the^ 
curriculum theorist examines* schooling from /a 
monocultural point of view, based on the premise that 
language distribution exists only in one social context, 
she/he is in a poor methodological and epistemological 
position. to examine the meaning of how one can move 
in, out, and around, under or over a concept as it relates^ 
.to multidimensional meaning in a social context. The\ 
assumption here is that a/knowledge is socially 
constructed and, hence, all Knowledge is valued in 
society. This also includes the'physical sciences. Kuhn 
. (1962) presents this problem through paradigmatic' 
, construction in the so-called "hard sciences." His basic 
* contention is that all "scientific knowledge" is 
developed through social paradigms vis-^vis language. 
If this line of thought is accepted, then one may be able 
to compare values from different phenomenolingual 
paradigms in the distribution of the social sciences. 



Bilingual educators may have the bicognitive "tools", to 
analyze and implement interrelationships between and 
among the social and physical sciences. I realize that 
one may. argue the point that one has * to be 
knowledgeable in each discipline to take on such a task, 
a fact thfeit is impossible. Yet, one can still be a 
generalist in the curriculum field and develop several 
models relating to testing in, a multilingual and 
multicultural situation. ' ' 

The question basically is the following: If a person is a 
."balanced" bilingual, and, "therefore, "bicultural" does 
' • language shift involve- a different social view of the 
world? For example, as a balanced bilingual possessing 
■" native-ability in two languages, I can conceptualize and 
commiserate with the .many faceted functions of two 
different types of "fpmily." When I use the term "family" 
in English, possibly in an industrial context, the term 
implies' specific values that may or may not fit the 
Portuguese rural term "familia." In English, family is 
usually synonymous with the notion of nuclear family, 
i.e.. the inclusion of only parents and siblings. In this 
technological-industrial context, I can extend this 
concept to a political level where family may be defined 
as thje single parent family, married people without 
children, the homosexual family, and other concepts 
that do not fit the Portuiguese concept of "familia." Other 
valued, aspects of "family" are the divisions of labor 
between men and .women, the use and /espect of one's 
private psychological space, and the j sharing • of 
decisions in terms of money, buying al home, the 
relationships with the children and so on/The nuclear, 
family values independent behavior from other people, 
especially when such behavior involves discussing 
personal and financial problems with people outside'the 
clan. 

r The Portuguese, term "familia. however, recon- 
textualizes my value image system. My images and 
feelings about the "familia" now focus on an extended 
relationship. Although my immediate family is still . a 
spouse and children, my parents are also considered a 
part of the "familia." My aunts, cousins, and godparents 
are seen as part<«f the "familia." Decision making does t 
not necessarily' belong to the father, but to the person 
who is "older and wise" and who is more influential in 

' . ' the clan. Children's roles are strictly "defined, and thereV 
is rarely a distinction among childhood^adolescence. 
and adulthood. You are either a child .or. an adult. 
Womer^and men have different roles as to the division 
of labor in and outside of the" home, "ParaT ajudar a 
familia," (to help the family) is such a strong commit- 
ment to the clan that any family member who fails to 
take on responsibility for helping out in a crisis is looked 
. down upon. Another term that may seem like a simple 
analysis until examined further is the concept of 
- automobile. When I say "Carro" in Portuguese, my 
thought processes or cognitive style perceives Fiats, 
VW's, Toyota's, etc; but when I shift my language to 
English, I automatically think of Fords, Chevrolet?. 
Plymouths, etc. Further, if I think of luxury cars in both 



languages, something similar happens. In Portuguese, 
Mercedes Benz^and BMW's become a part of my 
social-cognitive style; and in English, Oldsmobile Model 
98's and Buick Electra 225's become a part of my 
phenomena of everyday life as it relates to luxury 
.automobiles. 1 . 

The phenomenon of operating in two cultural cognitive 
styles makes me aware of the multitude pf cultural, 
variables that affect my behavior. When I attempt to 
apply these givens in th6 pedagogical field, I must think 
about the images in valuing , the two different 
environments, Another example of bicognition may be 
seen in participation in a New Year's gathering. In the . 
Portuguese working-class society, one way to 
participate in the ritual of leaving the old year behind is 
to attend a local social club. I am expected to bring my 
children along so the community may celebrate 
together by dancing and talking about the past yean 
The important aspect is that children do participate in 
all evehts. In contrast, my Apglo friends view this ritual 
in a different way. Usually, the children are not allowed 
' to be part of this event, especially if the group decides 
to go put for dinner ^nd dancing. 

Whatever the psychological and sociological analysis is 
for the valued behavior, it is not relevant for this paper. 
' What is important is that I can "know" and "feel" the 
two events as different ways of viewing^the world: 

A final example, of. many. 4$ that of couples walking 
down the street together. For the last several years. I 

.have observed language use and, h6w language t 

"/functions in a social encounter within cultural norms. 

/For exajnple. when American English-speaking 
monolinguals socialize with, another couple, the way 
they walk together gives .some interesting clues to 
language and ^culture. In this case., most American 
English speakers walking down the street together as 
couples will converse as a foursome. Very rarely will the 
males and females segregate themselves in a-jf. 
one-to-one conversation, male-female, but the dialogue .O 
will tend to center itself back to'ttie group. W he two 
Portuguese couples get together and walk down the 
street, 'the phenomena of social relationships 7 ^ social 
talk generally will change. In most cases, the males will 

--Jtalk 1o each other and the females will/dp the same. 
What is more fascinating is that usually the males will 
walk in front of the females and they will segregate the 
conversation. Although this behavior has many political 
and sexist implications, the reality is that language use 6 
anfl social relationships have a direct involvement in 
the reality of everyday life. 

As Berger and Luqkmann state: 

"Language forces me - into patterns 
Language provides me with a ready-made 
possibility for the ongoing objectification of my 
unfolding experience. Put differently, language 
' is pliantly expansive so as to allow me to 



I 



1 Since the gasoline crisis, there has been a shift in my own value distribution of knowledge. 
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objectify a great variety of experiences coming 
my way in the course of my life. Language also 
typifies experiences, allowing me to subsume 
. them under broad categories in terms of which . 
they have meaning not only to myself but also 
to my fellowmen." (1 967) 

This analysis, although limited in scope, provides the 
curriculum theorist with a clue on how Image 
production, assessment, and test construction via the 
role of language relates to the social environment.. 
When two or more languages are used, it is possible to 
•contrast and "flow-through" the use of two or more sets 
of valued knowledge : What is mdre important is that 
the curriculum theorist cannot talk ^bout knowledge 
and learning until she/he is able t® specify what is ' 
. considered "worthwhile" in the context of a. specific 
culture. ' • • s 

Using the above 'examples, one may note separate 
image production is closely related to coordinate 
bilingualism. rather than ' conr^pound bil ingualism 
(Weinrich, 1953; Ervin and Osgood, 1954; Lambert, 
1958). Lambert (1 958) popularized the compound- 
coordinate paradigm through the use of mediation 
theory of semantics. If the person is a coordinate 
bilingual, his/her mediating semantic responses will 
elicit differing responses based on the language of the 
SIGN. Likewise, if the person is a compound bilingual, 
the mediating, semantic responses will be the same 
regardless of the SIGN language of the response. For 
coordinates, the corresponding pairs of terfiis in two 
language^ signify one "semantene." Compound 
bilinguals ■ are for those , when corresponding terms 
signify a single semantene regardless of the language 
itself (Martinez-Boyd). . 

Image production becomes a useful tool in testing 
situations to reproduce items that are both valid and 
reliable. If one is in a coordinate situation where 
differing situations exist, the images of each sign 
(usually called cultural variables) may be identified for 
further test construction. If only one sign is r in use (a 
compcjjund situation), the variabterfek may be explored 
for further analysis: 1) That the specific sign is not 
known and an item may be developed in a test for 
discrimination; 2) The item (image) becomes a problem 
of cultural capital or distribution of knowledge where 
the bell-shaped curve may bias a specific st^cial class. In 
either case, compound or coordinate, image production 
may be the avenue for further test production. 

^Language Assessment in a Particular Language May 
Necessitate the Proper Images That are Socia lly 
Constructed From a Particular Language Group. 

Thfe question is then, what is the propeV iTnage that may 
be reproduced for test or assessment procedure? It 
seems that image production is directly related to 
language production and valued knowledge of a 



specific culture, That is, each culture has its own 
collective agreement of what is "correct" and what are 
the ; "proper" responses for individuals in an assessment 
or jesting situation. My examples of the family/farfulia 
or code switching in- Portuguese and in English are 
perfect illustrations of different but "correct" images of 
two different cultural behaviors;- — — 

.* ■ 

What this means is that educators must approach 
testing and image production by two sets of factors, or 
. what I term system and context. In the system 
. approach, the theorist will . apply general theories of 
learning or general theories of behavior to a specific 
situation. For example, one may select and develop a 
curriculum around programmed instruction. Then the 
theorist must apply the CONTEXT (cultural variables) 
that will fit the valued knowledge within the specific 
culture (Simoes. 1979). AnQther model, of ma'ny. would 
be the use of Freud's exploration of human behavior 
vis-a-vis the superego, ego. and the id. Here the system 
is the conflict between the superego and the id which 
results iq an outward behavior called the ego. Applying 
this system to context, however, is crucial for positive 
and negative learning prescriptions. Again, when twoor 
more contexts are functioning within. .one logical, 
system, the theorist has different ways of analyzing a 
paradigm. Another way to look at this issue is as 
Christian states that the child develops, between the T 
and the "me," a system of communication that is 
normally monolingual, representing a relative coherent 
and consistent system of concepts, but this .situation 
may change rapidly when the "me" starts speakTng 
another language. This change implies a new set of 
social rules for the "I." a new and different type of" * 
conversation between the "I" and thfe "me." a. new 
. self-concept for th^Serson (SimSes. 1976); This implies 
that educational theorists must be knowledgeable 
about . the differences of image production and how this 
process is directly related to knowledge . use and 
knowledge learning. When the theorists can be aware. 
Bnd know two or more contexts within one system, 
then there is a possibility of understanding the nature of 
assessment in testing in the bilingual child. The field of 
bilingual education may supply new inforraratron in this 
domain. • V 

Image Production May be a New Way of Analyzing 
Language, Bilingualism, Assessment and Testing. 

As noted, being a balanced bilingual gives the viewer of 
the world two structured valued knowledge systems 
and the feeling of what is different in two languages. To 
"feel different, words," to "feel different domains of 
language," to "feel different social constructions of 
reality, to "feel different senses of touch," as they 
relate to cultural forms, to "feel or see different 
perspectives^ the world,", to "feel different forms of 
^ mathematics as the,World is divided through language 
and signs vis-a-vis culture," and so on, gives , the 
bilingual/bicultural person a way of analyzing the world 



• * 

2 | have used the term ■'reality*' in the singular but used the term constructions in the plural. Either use ol these terms could be analyzed from 
different perspectives 



through a feeling of relativity and how acknowledge is 
based on valued assumptions. 

Now I ask: Can a monolingual person view the world 
from different perspectives and analyze and construct a 
world view from the f>oint of view of interdisciplinary 
studies? Contrary to what on6 might expect, the answer 
seems to be affirmative. There are many ways to view 
4an§uage.-Ever> <witl^ 

varied styles and dialects, and these dialects are usually 
divided along social class lines. A word in one skia 
context may have a different meaning in another sot 
context. Unfortunately, most educational institutiqn^ 
promote only one type of language system, called 
standard language, and other dialects are usually seen 
as an inferior'language system which is frowned upon. 
Valued knowledge, language and the interrelationship 
of both domains cannot be talked about or be 
implemented as interrelated phenomena. unless specific 
interrelationships are known by the theorist, be it from 
one language or phenomenon borrowed from one social 
class to another Jn other words, language cannot 
divorce itself from the phenomena ofrreality or everyday 
life and different forms of language, be -it a dialect or 
not, and cannot be imposed upon a single set of criteria 
in pedagogicalsjtuations. 

What becomes a more important * problem is the 
concept of, : compound bilingualism in .the area of 
testing, assessment and image production. I do realize 
. that there are problems with the use of the distinction 
.of the compound/coordinate paradigm and^ there is 
extensive literature since the early days. of Weinr^tch 
(1953), Ervin and Osgood (1954), Lambert (1958), and 
MacNamara (1970). One also has to realize that there 
are other schools of thought regarding language irt a 
one-to-one correspondence or the "mixed language 
• theory" (Huerta. 1977; Chimombo, 1 978; Martinez- 
Boyd. 1978). One may use the. concept of "additive" 
versus "subtractiye" environments (Cummins, 1976) as 
a reference for the coordinate and* compound distinc- 
tion. The, literature is rich indeed of how bilinguals 
perform on certain tasks in a continuum of language 
■ development (for example: Sienkiewicz, 1974; SimSes. 
1976; Hoffmann and Ariza. 1978; Redlinger, 1979; 
Ben-Zeev, 1977; and Doyle. 1977). Yhe problem in our 
/research data base is that we still do not know .where 
the semantic system changes to different signs for the 
same concept or test item. Hence, there really is no 
distinction Between a compound bilingual and a 
coordinate bilingual. It is really a continuum in language 
development in that "bilingualism" is only produced 
when two or more images are produced by the same 
sign. It is in .this context where image production 
. becomes^ viable methodology for further research in 
language assessment and test construction, language 
is in some aspects an acquisition of knowledge by a 
specific social class. Language does distribute, 
knowledge and images or behavior. When two or more 
languages are acquired, the same principal still remains 
in effect That is, language(s) are produced by a social 
distribution of knowledge. 



The Hidden Curriculum and Political Ideology 

Before closing, I think it is important to make^/eference 
to what I term the hidden curriculum and political 
ideology with reference to language use. It is a known 
reality that schools still seek to maintain a specific 
cultural capital through.each social class. 

, .TQ.atjempUp. .impose. a .language on..stuients,vyho. do 
"noV speak it or to impose a specific valued knowledge 
system implicitly violates the civil rights of every child 
vyho is linguistically or socially different. The so-called 
accepted knowledge norm of the educational 
establishment may be hazardous to the children's right 
to fully participate in an integrated society. Haugen 
develops this theme when he talks about lariguage. He 
states: 

The concept of 'norm' in reference to language 
is highly ambiguous and slippery. It may refer 
.4 ' to a standardized language like French, codified 
in grammars and sanctified by an Academy, 
taught, in schools, and written by authors.'but 
spoken by no one, except under duress. Any 
deviation from such a norm is deemed to reveal 
one's lack of .a proper Education, and is. ■ 
regarded as barbarism if it is unintentional. But 
it may be acceptable if it is an intentional 
" "stylistic variation, either as a mockery of the 
lower classes or as a relaxation of standards ... 
' (1977) * % : 

Unfortunately, most curriculum theorists, if not all, 
cannot divorce themselves from political ideology and 
hence it is sometimes difficult to separate an 
educational or a language analysis from political 
thought. The use of image production in the analysis of 
bilingualism and biculturalism may suggest a new 
restructuring of political ideology as it relates to 
curriculum theory. It suggests ^that no curriculum 
theorist can "objectify" an educational process without 
knowing the cultural variables of the learner. 

I believe that it is legitimate to state that the schools 
have locked out cultural, differences, which of course, . 
are language and dialectical differences in the racial , 
and language minority communities. Educators have. 

< not. until recently, looked at this process as a 
pedagogical force that rinay^create new ways of Igokirig *fr 
at cognitive differences and how these .differences ' 
relate to knowing more than one way of viewing the • 
world. When I hear the Black community stating that 

'most Blacks know how to "play the White mans game," 
or when I hear a bilingual state that she/he can move in 
and out of one culture to another, it suggests to me that 
many social science skills have been developed by the 
voices of minority groups. More importantly, it strongly 
suggests that curriculum makers have not seriously 
looked 'at cultural differences as a possible way. to 
shape school programs around meaningful experiences 
for children. What has happened is that a political 
model has superceded the pedagogical/language 
model and, consequently, most schools were based on 
a mono^ocial-class paradigm that fits only the "selected 
few" in our society. > - r 
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THE PARADOX OF THE PROPER NORM FOR ESL EVALUATION 
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Abstract 

The question of what are-appropriate norm groups for * 
n § Neh-^pgyage^prof Lcienay . .teste. M$e£ § te - . . 

English skills growth is critical. Because all students 
who are identified as limited in English proficiencies 
must be served by programs, control 'groups from 
whom treatment is withheld are usually not available. 
Typically, the tesjt norms are used as proxy comparison 
.^.groups to circumvent these problems. However, 
instruments of this type are generally normed on 
monolingual or mainstream English populations, who 
may not be. appropriate standards to measure pretest to 
\' posttest growth. They have hac# greater exposure to 
. English at the time of pretest. To achieve the same 
score, their rate of skills acquisition must therefore be 
different. A possible solution may be to norm these 
tests on limited-English-proficient students. However, 
since these students must receive supplemental 
English intervention, they cannot constitute an 
expectation of how* much students would have gained 
without the program. A better solution is derived from 
the concept of discriminant validity. A totally\ 
discriminant test completely separates limited-English 
and English-proficient students/while a nondiscrimi- , * 
nant test fails to separate these populations. Analyses 
of these two extremes suggest strategies for using 
English norms developed from mainstream students as 
' proxy evaluation comparison groups. Examples of these /- 
strategies and data generated from their use in New 
Jersey studies are presented. 



%Overyiew • 

In New Jersey, English language proficiency is defined 
in normative terms. Students who speak other 
languages at home are placed into English as a Second 
Language (ESL) or bilingual education programs which 
have ESL components if their English' language skills 
are poorer than those of their grade peers. This 
judgment is macte by assessing student performance on 
placement tests normed on monolingual English, 
students. Because ESL and full bilingual,education 
programs receive state funds, they must be evaluated. 
All limited-English-proficient (LEP) students must be 
served by these programs; Therefore, evaluation 
designs cannot use control or comparison groups in the 
. usual sense because treatment may not be withheld 
from any students with similar needs. 

The instrument 1 programs use for placement should be 
sensitive to differences in English skills and also to 
changesjn students' English proficiencies. It might also 
serve to evaluate the impact of state programs on 



English skills by using the normative sample as a proxy 
comparison^group. Such a solution would greatly 
economize testing in bilingual education ptograms. 

fhis solutioh is similar to that employed by the Title I 
(now Chapter 1) Model A1 (Tallmadge and Wood, 
1976). As in bilingual/ESL programs all students with 
identified needs receive treatment. Pretests and 
posttests are analyzed in norm-referenced scores, 
usually Normal Curve Equivalents (NCEs). When 
students score the same NCE on an April posttest as 
they had on an October pretest, they have gained 
approximately as much as students in the test norms 
over these six months. Increases in NCEs mean that 
. program students have gained more during some 
specified period (usually a school year) than might have 
been expected from the scores of students in the 
normative samples (Tallmadge, 1976). 

On the surface, this solution offers bilingual arid ESL 
programs a means *of using tefet norms as proxy 
comparison groups. Students could be administered an 
instrument normed on monolingual English students. 
Their NCE scores could be used both as one criterion for 
program entry/land as a pretest for program evaluatibn. 
Such a model ceWmly economizes the testing program. 

Unfortunately, this model assumes that . growth in 
English skills made by /the ., monolingual English 
normative sample is treasonable expectation of growth 
for limited-Ertglish-proficient students. " ' 

/, ~ . 

For example, third grade Title I students are probably 
homogenous . in both Title I and school experience. 
Students in the "test's norm samples for -that grade^are 
similar in school experience and have had similar 
exposgre as the Title I students to the area being 
measured. Bilingual education 'classes are not as 
homogenous. Moreover, their exposure to the skill 
being measured by an English proficiency test is 
different from the-English exposure of the normative 
sample. - : 

Clearly, use of monolingual English norms for 
evaluation purposes is not a simple issue. This paper 
explores various applications of these norms, 
considerations that must be made in their use and some 
alternate approaches to the evaluation issue. The first 
question a test user must ask is whether one instrument 
may be valid for placement and evaluation. 

Validity of Measures 

In this instance, the construct being measured is 
English proficiency in the academic setting. Placement 



•If it were the sole criterion, it could not be used as a pretest. Student scores might regress to the mean and show spurious gains (see Tallmadge 
and Wood. 1976). 
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and evaluation are related because both are concerned 
with demonstrating differences in proficiency, 

Tests which serve both functions should be sensitive at 
and below the cutoff (scoring criterion for program 
eligibility). This is insured during the item selection 
process. Less difficult items increase the test's 
sensitivity in this range. One means of determining if 
the test may serve both functions is reviewing item p- 
values to determine if they are not inordinately^difficult 
for LEP students. 



Evaluation Test Validity, Validity of evaluation 
instruments depends on their sensitivity to the course 
curriculum, t This is assessed by matching test 
objectives to course objectives. 

The. objective of ESL programs, as part of bilingual 
education or as independent programs, is to bring they 
English skills of participants to the levels 
^English-proficient age* peers. Evaluation instruments 
should reflect initial deficits and increases in English 
skills, 



Validity and Cutoffs. Validity in placement instruments 
is determined by how well the cutoff is set. fys it 
changes, the validity coefficient varies. / 

Grehan (1974) examines this relationship. He defines . 
the validity of a mastery test as its ability to make 
correct decisions about students. In proficiency testing, 
the instrument should - place LEP students fnto 
bilingual/ESL programs and English proficient students, 
into the regular school curriculum. If two samples. LEP 
and English-proficient students, are tested, then validity 
is operationalized as the ratio of the number of students 
1 correctly placed to the total number of students tested^^ 

Hambleton and Eignor (1980) provide two means of 
setting cutoffs which should maximize test validity. In 
the contrasting group method, plots of monolingual and 
LEP students' scores are made. The intersection of the 
scores is the first estimatefof the cutoff, It is adjusted 
upward or downward to maximize the validity 
coefficient defined above. 

The borderline group method uses teacher judgment to 
identify students who are at the borderline, in terms of 
being academically proficient in English. Their median 
score becomes the cutoff. 

A similar pnethod involves testing LEP and English- 
proficient students and analyzing group, membership 
via discriminant analysis. The highest score for 
membership in the LEP group and the lowest score for 
membership in the English-proficient group should be 
avejaged to yield the cutoff. 

Ulibarri. Spencer and Riyas (1981) claim that variation 
in cutoffs causes disagreement between instruments 
on placement decisions. The cutoff determines if a child 
is proficient enough to be placed \n rfS a monolingual 
English classroom. Agreement between the test 
decision and the child's proficiency determines validity. 
Thus, the instruments vary in validity to the extent that 
they vary in how they would place a child. 

Thorndike (1982) defines taxonomic validity in a similar 
manner. The validity to make placement decisions is the 
ratio of variance between groups to total Variance. The 
square root of this is a poipft biserial coefficient. The 
coefficient is more informative than the proportion of 
correct placement decisions because it accounts for 
scoring variation on th^ test within groups. 



Finally, the validity of both placement and evaluation 
instruments depends on their capacity \p measure 
proficiencies relevant » to English /classroom 
performance. Test items should iae efmbeddecl in 
academic materials, or what Cummins (1981) called 
"context-reduced," if they are to p/edict classroom 
proficiencies- 



Homogeneity (UnidimensionalU 
both types of instruments to di# 
can be rnaximized through tl 
Samples of varying English 



The sensitivity of 
i^nces in English skills 
item-selection process, 
skills, including LEP 



students (Angojif, 1971), should be tested. 

Homogeneity- of the instrument* over the domain is 
central to the validity^ both types of tests and to the 
validity of multiple uses of" ja 'test. Unfortunately, 
technical manuals /arely cod^fm information related to 
this issue. Yet, under the common underlying 
proficiency model (Cummins, 1981), the English^ 
language skills of LEP' students are predicted *by an 
underlying linguistic construct (Cummii^s, .1981).* This- 
construct is manifest in language sloTteais^ll ranges of 
development. In placement tests, homo^n^jf allows 
comparison of the skills of English-profici^m and LEP . 
studehts because it assures the instrument is focused 
on the proficiency variable. (See Hambleton and Cook, 
1977 for a- review of this 'assumption.) In evaluation 
instruments, homogeneity guarantees that scoring 
differences are related to skills differences. 

The test may be shown to be homogenous over ttae 
domain through factor-analytic or other scaling 
techniques. This also may be determined if the items 
retain their ranks in difficulty over populations Qi 
varying English skills. The hardest items for .LEP* 
students should also be hardest for English-proficient 
students. Variation in "the difficulty ranks would occur if 
the test is measuring different skills in each group. 

If the test is not homogenous, it may separate children 
on other skills besides proficiency. Moreover, iffsuch*a^ 
test • measures different variables as studervfs' 
proficiencies increase, "it will be of tiftle evaluation 
utility. 

. • '<". - " " \ • . 
/ * * • . 

Reliability * 

Both placement and evaluation tests should be 
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internally consistent within areas (reading, speaking, 
etc) and over time (test-retost reliability), 

Testa that are particularly sensitive to surface skills (see 
Dieterich, Freeman and Crandall, 1979 for review) may 
show large student improvements over time by virtue of 
repeated exposure to its written and. oral vocabulary. 
Tests like the 1982 edition of the Nevy York Language 
Assessment Battery (LAB) offer alternate forms to 
avoid thtsVamiliarity. When multiple forms are used, 
e.g., "one as a pretest and another as a posttest, the 
instrument should demonstrate alternate-form 
reliability. Horizontal (across foyn) equating is also 
needed tb permit comparisons across levels. 

Equating 

Placement tests are usedJor program entrance and exit. 
During the intervening years, ^tudents change grades 
and, often, test level. Changes in scores may reflect 
variation in the objectives of these levels or in the 
abilities of the normative samRles. Placement tests 
should be^quated vertically, across levels. 



Use of Norms 

The instrument's ability to interpret English skills may 
be increased or reduced by the choice of scoring 
metrics. TheV current L^B (Riverside Publishing 
Company, ,1976) is normed on monolingual English 
students. ESL evaluations using norm-referenced 
scores use NCEs or other standard scores which refer to 
these students. 

* 

Angoff (1971) and others argue that reference to a 
particular norms group imposes a meaning on results 
that is not only inaccurate at times, but is always 
becoming obsolete, since the norms are based on .the 
distribution orskills at one point in time. 

This argument applies to proficiency tests normed on 
monolingual English samples. Although the 
norm-referenced scores provide ../a comparative 
. standard of skills of LEP students to monolingual 
English peers, this standard may be dated. 

.Moreover, test norms may not be as useful for ESL 
evaluation as they are' for Title I. The Titte I model 
assumes that, at \he time of pretest, the normative 
sample for any grade had as much exposure to the skills 
being measured . as the Title I group. This assumption 
permits an expectation for posttest scores to be derived 
from the performance of the normative sample. The 
expectation is compared, to actual student posttest 
scores. ' 4 V\. 

This assumption cannot be made in evaluating ESL 
performance - because at the time of pretest, ESL 
students have had- fess exposure to English than their 
monolingual English grade peers.* Consequently, a LEP 
student scoring the same on the pretest as monolingual 
English students in the test norms may well score 
higher on the posttest. The LEP student scored as well 



on pretests after less exposure. V^Hh an equal time of 
exposure between pretest and posttest and a structured 
curriculum, the LEP Student should perform better. ^ 

It seems more appropriate to use students with similar 
exposure to English to norm evaluation tests. For 
example, fall performffnce<^of LEP students^ could be 
"referred to LEP fallN norms, and spring posttest 
performance might be compared'tc^sprt^l^^^^ri-- 
each grade. 

Unfortunately, this' model is controlled too well.. 
Between the pretest and posttest, LEP students 
comprising the norms will have received ,ESL 
instruction (by law). The normative sample cannot serve 
as a proxy' comparison group because its performance 
will not represent "no-intervention" achievement. 

This problem can be avoided , by norming tests on 
students who have not yet had ESL instruction. Rpr 
mos't districts, indeed most states, this would require 
large constant flows of immigrants. * 

Even if this were possible, in states like New Jersey, 
such norms would be decidedly unrepresentative of the 
current LEP population in sociocultural factors. Angoff's 
(1971) conament on norms obsolescence would be 
most meaningful if immigration patterns continue to 
vary. : ■> \ 

Also, the normative sample might not be an'appropriate 
comparison group for second, year, third year or more 
experienced ESL students. The problem is similar to 
that involved in using* monolingual English norms. Such 
students have had more English exposure at the pretest 
than normative sample - students; They should* be 
gaining English more slowly..t(ian immigrants who have 
scored as well withoui as much exposure. ' 

* The final general consideration regarding the use of 
' 'normative samples 'as .proxy comparison groQps is 
statistical power. LEP students should perform poorer 
than the monolingual normative- sample, and the 
distribution of their scores should be confined to lower 
score ranges/However, unless the distribution is fairly 
normal, the statistical tests may lagk sensitivity to 
student growth orjo differences in group abilities. 
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For example, suppose^evelV'fcEP student scores lower 
than the monolingual English normative group 'and 
received an NCE'of 1. If the lowest ability student scores 
as well (in raw scores) on the posttest as the highest 
ability student did on the pretest,4hat student will still 
have an NCE of 1 on the posttest. The instrument would 
not have.been sensitive to the student's growth. Such 
scores, in themselves, are meaningless for the teacher. 



New Jersey Study 

This difficulty is illustrated with scores from 731 New 
jersey bilingual education students, who were 
cluster-sampled for a study of English language 
acquisition (DeMaUro, 1981). They represented state 

40 ' ' ' • - - ' 



bilingual educatio^students in socioeconomic- status, 
program types, region of state and program experience. 
Students in grados M2 wore tested. This samplQ is 
currently in its third, year of assessment. (See Appendix 
A for a summary of the first year) 

The New York City Language Assessment Battecy 
(Cumbo, OfNeill. Tilis and Weichyn, 1976) was 
v W3m inistered 4 irt v '0cVdBerr979 ^ m^\K\tm^fri^^r 



Scores were Converted to NCEs based o 
availably in the technical manual (Rivers 
1976) and fall norms available from the 
Office of Testing. Both norms were dev< 
monolingual English samples. 




g norms 1 
jshing, 
rk City 
from 



The score distributions wore compared to nofmal 
distributions via goodness of fit tests. Frequencies of 

* scores wore computed in intervals representing one 
tenth of the range of scores. Chi-squares were 
computed, >for both raw scores and NCEs, for each year 
of program 'experience. Large numbers of first and 
second graders permitted separate statistical analyses 
in these grades. For other grades, the statistics were 

^computecU^v^i: JeyeJs of the test (grades 3-6 and 7-12), 
because fewer students were tested in each griide. 

The results (Tables 1 and 2) reveal positive skjews in the 
NCE distributions, except for second- and \hird-year 
. second graders on the posttest. In these grades, the 
distributions are negatively skewed. 



Table 1 



Distribution of LEP Scores on the LAB 
Goodness of Fit Statistics 
and Standard Deviations 



NCE 



Goodness of Fit 
Chi Squares 1 



Standard 
Deviations 



Grade 


Year 


Pre 


Post 


Pre 


Post 




1 


(5)71 841 ### 


(5)37.963*** 


26.845 v 


24.077 




2 


(2)19.955** 


. (5)39.601*** 


26.000 


-23.719 


2 


1 






15.298 


20.364 




2 • 


(4)13.953** 


(4)15.920** 


26.189-. 


29.266 




3 


(4) 9.867* 


(4)22.447*** 


24.092 


30.99 


3-6 


1 


(1)16.920*** 


(2)15.027*** 


8.939 


14.661 




2 


(2)45.684*** 


(4) 16.349***, 


17.134 


17.184 




3 


(3)20.524*** 


(3)30.538*** 


13.838 


15.791 


7-12 


1 


(1)47.806*** 


(mol- 


10.040 


10.948 




2 « 


(1)19.616*** 


ten 7.277*** 


13.594 


12.699 




3 


(1)28.91.0*** 


(3)11.900** 


14.751 


13.879 


RAW 












1 


1 


(3) 5.650 


(4)13.538** 


8.060 


8.217 




2 


(2) 1.133 


(4) 0.586 * 


6.291. 


6.525 


2 


1 






8.094 


6.297 




2 


(4)21.220— 


(2)30.397*** 


8.475 


5.640 4 




3 


(4)'86.321*** 


'(1)31.920*** 


5.708 


4.711 


3-6 


1 


(2) 8.218* 


(3) 6.282 


17.352 


17.340 




2 


(3) 6.635 


(3) 8975* . 


19.207 ' 


17.505 




3 


(4) 1.051 


(3)10031* 


-17.961 


16.703 


7-12 


1 


(3) 4.714 


(3) 3598 


15.688 


1 5.045 




2 


(3) 4.287 


(2) 1766 


15.039 • 


13.657 




3 


(2) 1.455 


(2) 1.787 


13.568 


13.333 



Degrees of freedom are in parentheses 

•p < 05 
"p< 01 
•••p< 001 
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Table 2 



ifi 

o. 



0) 

ifi 

£ 



Percentages of LEP Students 
Scoring Lowest (Floor) and Highest (Ceiling) 
Possible Scores on LAB 







NCE 


RAW 


NCE 


RAW 






Scores 


Scores 


Scores 


Scores 


Grade 


Year 


Percentage at Floor 


Percentage at Ceiling 


1 . ' 


1 


04,OJ 


3 03 


1.52 


1.52 




i 


24.66 


1.37 


1.37 


1.37 


2 


1 


fil fi4 


4.55 


9.09 


9.09 




2 


25 00 


1.92 


5.77 


5.77 




3 


12.p0 


2.00 


6.00 


6.00 


3-6 


1 


* 43 75 


3.13 


* 6.67 


6.67 




2 


27 85 


1 27 


1.27 


'1.27 




3 

« 


16.Q2 


0.93 m 


0.93 


0.93 


7-12 


1 




1.30 


1.30 


1.30 




2 


^0 RR 


1 6V 


T.61 


1.61 




3 


3.0.99 


1.41 


1.41 


1.41 




few > 

9^ 






/ 




1 *H 


z4.z4 


1 R9 


A RR 


4 5^ 




o 
z 




1 ^7 


2 74 


2.74 


2 . 


1 


22-Z3 . 


9.09 


4.55 


4.55 




2 


7.69 


1.92 


15.39 


.17.31 




3 


2.00 


2.00 - 


28.00 


28.00 


M 


V 


28.56 


1.56 


3.13 


3.13 




■' 2 


•13.92 


1.27 


1.27 


1.27 




3 


. • 7.48 


093 


0.93 


1.87 


7-12 


1 


45.45 


. 1.30 


1.30 


1.30 




2 


32.26 


1.61 


T.61 


1.61 




3 


19.72 


1.41 


1.41 


' 2.82 



In fact, although there were too few students (22) to 
test the distribution, 22.7 percent of first-year second 
graders scored. the two highest posttest scores. Among 
third-year second graders, 52 percent scored at least 39 
on the 40-item test. Second graders' scores probably 
reflect the ease of thif test level, which must also be 
sensitive to readinq and writing skills of kindergarteners. 

In a test that totally discriminates LEP from 
English-proficient students, all entering (first-year) LEP 
students would score an NCE of 1 because the highest 
LEP students, would score lower than the lowest 
monofopTgual English student This is an outcome of a 



very skilled norms group, rather than of an overly 
difficult test An overly difficult test would result in 
positively skewed raw scores, but the raw scores- of 
bilingual education students on the LAB are normally 
distributed, while the NCE scores are not If the variation 
in raw scores of LEP students is meaningful in terms of 
abilities, then the -conversion tp NCEs sacrifices this 
Information. • 

In the case of a totally discriminant test, a linear 
transformation of the raw scores based on LEP student 
scores would provide the evaluator with more 
information concerning student gains. With such a 
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transformation, a totally discriminant test" would 
separate those LEP students whose scores were most 
similar to English-proficient students, Providing tho 
instrument wo.ro rnoasuring*he samo trail-across ability 
levels, this transformation would enable programs to 
use the* instrument fo\Dlacemont and evaluation 
purposes. ^"N 

— ' T+rts ^omfttion-wottld-bo-fne^-^' rhe-orders-of item 
difficulties were the same for LEP and English-proficiont 
students.. If the o/dor varies with increasing 
proficiencies, the test is not measuring the common 
underlying proficiency Separate analyses wpuld be 
necessary for various proficiency levels, and the 
instrument would be of questionable utility for/ 
evaluation. 

Even more homogenous tests may be better analyzed 
by blocking (partitioning) groups according to program 
experience. As Myers (1972) points out. such a design: 
(a) makes, groups more homogenous Regarding a 
variable that affects scoring; (b) yields meaningful 
interactions (as we shall see); and (c) is more efficient 
than a one-factor design. Cummins (1979) agreed and 
arguesrthat ignored (not partitioned) sociocultural and 
linguistic factors often reduce the power of evaluations 
of bilingual education. 

Within a grade (or tpst level, if the student sample is 
.srnaU) and language group, three LEP groups mjght be 
designated. The first has one or less years of experience 
in ESL. The second- -has ' two and the . third has three 
years experience. Two empirical results recommend 
partitioning years of experience: (a) the proportions of 
students on the test floor (Table 2) rise as a function of 
yearsih program: (b) in New Jersey, students who have 
* more prograrr^ experience outscore those with, less 
experience in English with? reference, to monolingual 
English norms. All groups are tested ih October and 
b again in April. * i 

• Such a design may be analyzed via ANOVA or a general 
. linear model performed on NCEs* derived from 
monolingual English norrhs. If one uses the norms of a 
language proficiency test, the pro^y control £roup is 
probably a monolingual English sample. A 
between-subjects variable is program experience. 
' which we will call Factor A. The effect of ESL 
vexperience estimates discriminant validity of the test, in 
the sense of Thorndike's formulation of variance 



. between groups defined by English proficiencies, 

All possible outcomes of this analysis are considered in 
# tprms of significant offects. A discriminant test will 
yield significant differences among groups based on 
program experience. A nondiscriminant tost will not be 
sensitive to group proficioncy differences. 

Two wodals-ef outcomes are investigated; results of the 
totally nondiscriminant test and result's of the totally 
discriminant test. The utility of placement tests normed 
on monolingual English samples for evaluation 
purposes will be examined under these extreme 
models. Considerations for the evaluation utility of the 
instrument under* both extremes will apply to the 
evaluation utility of placement tests currently used by 
programs. 

The criterion for utility under the extreme cases is 
whether or not the outcomes are meaningful in terms of 
second-language acquisition theory. If the theore^a! 
factors which determine these outcomes are consonant 
with the ESL tffbgram^tfien the test is probably 
sensitive to the intervention. If they are not. then the 
instrument may be measuring extraneous variables. 

For example, students may regress in English skills, 
relative to the monolingual English norms, if intensive 
English is introduced before the students have 
developed first-language competencies (Cummins. 
19£h). If. first-language skills are 'assessed and 
strengthened and precautions are made against this, 
then the test may be measuring other extraneous 
factors, and its utility for evaluation must be questioned. 

Finally, the totally discriminant T test is one in which 
there is, no overlap in scores on the pretest between 
English-proficient and first-year ESL students. A totally 
nondiscriminant test cannot distinguish these two 
groups. , 

* \ . ■ 

Definition of Possible Outcomes 

Table 3 and Figure 1 show the'possible outcomes of the 
evaluation design described above. The four conditions 
represent variations of significant effects for totally 
nondiscrimi'nant and totally discrimjnant instruments. 
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* TaWa3 

Significant Effects In 
Illustrative Evaluation Modal 



ConcMtfon 




Significant 
wunin 
Subject* ' 
~ EffactiT 




1 


Totally Nondiscriminant 1 
Totally Discriminant 1 * 


Pre-post 


Large between-subjects error 
rrogram euecuvonejui 


- 2 


Totally Nondiscriminant 
Totally Discriminant 


Pre-post x 
Experience 

!;1 


First language deficiencies 

kvyounger groyp 
Same | 


3 


Totally Nondiscriminant 
Totally Discriminant 


Pre-post, 
Pre-post x 
Experience 


Individual differences in 
growth patterns offered by 
ESL experience 

Course of curriculum 


4 


Totally Nondiscriminant 
Totally Discriminant 


None 
None 


Instrument failure 
Instrument failure 



Totally nondiscriminant tests do not distinguish groups by years of program experience. 
Totally discriminant tests yield significant program experience effects. 



Figure 1 

Possible Evaluation Outcomes of 
a Proficiently Test in a Mixed Design 
(Pretest-Posttest X Program Experience), 




Totally Nondiscriminant Test 




Totally Discriminant Test 



It 13 ttaaumtj^Bpfmoftf experienced I St students will 
niiorti higher onhighly diacnminanl tests As sludanls 
gam FSL exparutiuo, the distribution or (hair tngliah 
proficiencies has greater ovodap with the scores ul 
monolingual l.ngtish students 



Condition 1 

Totally Nondiscrlmihant Case Only ptetust to posttest 
«U«K>t* .>ftr*.~440mJtc*mU „ Iliis.,. Would. .t>«. . . tfP .unlikely 
'outcomo lor an instrument that could not distinguish 
Students ol dilforent proliemncios A largo butWeen 
sublets error (S/A). hut relatively small withm-subjucls 
error (SB/A), could accoUrif for this result 

Factors that am not partitioned could contribute to tho 
. between ^subjicts error This agrees with Cummins 
(1979) behel tK|t subject factors influence language 
proficiency For example, socioeconomic status 
significantly predicted English acquisition among 
students in New Jersey (DeMauro. 198U Another 
source of ' unmeasured variance, especially among 
" younger students is whether or not thoy have 
developed requisite linguistic skills in their first 
language Such unmeasured sources may inflate error 
and cause underestimates of program elfects 

Totally Discriminant Case. This occurrence is very 
plausible Each group makes about equivalent pretest to 
posttest gams, and the moro experienced groups score 
higher than the less experienced groups 




In the New Jersey study. Level II (graces 3-6) scores 
yielded results similar to t^iese. Although other 
variables (language, socioeconomic status and grade 
level) wete partitioned, there were significant pretest to 
posttest gains and students With more program 
experience outscorod students with less program 
experience 



tujlhcient first language competencies (Third year 
ohildien may hava been exposed in preschool) Snow 
and Howtnagwl Mohld (UJJfl) report that lust language 
lluency decreases 111 young subjects on intensive 
exposure to tho second language Competence in }he 
second language would than also sutler it the two are 
interdependent 

It the group which regresses* is the lust year group, the 
results might explain program allacts Perhaps this <s a 
'bilingual program which stresses initial strengthening 
ol hrst-language skijts. and the test measures English 
surlace skills there may be an initial relative loss in 
those skills (hrst year), followed by small increases or 
decrements during* the second year: and a growth spurt 
during the third* yeai. as students begin developing 
deep structure competencies Cummins (198f) claims 
that language minority students approach norms more 
wisilyjn surface proficiencies than in "context-reduced'' 
or academic proficiencies Such an outcome would 
indicate sensitivity of the instrument to surface skills, 
but not necessarily academic proficiencies. Tho 
instrument should be used if these skills are of interest, 
but academic performance in English will probably not 
be predicted by these surface skills 

Totally Discriminant Case. Significant botwoen-groups 
and interaction effects would appear as a fanning (See 
Figure 1) The developmental interdependence^ 
hypothesis might explain these. Students with poor 
first-language skills might continue to fall further 
behind monolingual English peer^s, while those with 
hotter ftrst-lpnguage skills would grow quickly If all 
students wero in the same grade, those with more 
program experience may not have had the opportunity 
to develop requisite first-language skills. 

One must be cautious that the instrument is measuring 
language proficiency over the full scoring range. For 
example, among higher ability students, tho instrument 
should not measure another Construct like creativity. 



Condition 2 

Totally Nondiscrimihant, Case. The amount of gain, here, 
defends upon program experience A clear example of 
such an outcome is a crossover effect (See Figure I). A 
loss in NCEs represents less growth than could be 
expected from the growth of the monolingual English 
normative sample. Such an unlikely interaction might 
occur if English skills development actually regresses 
relatively, given certain sociocultural or cognitive 
factors. 

Cummins (1979) accounts for such cases in his' 
developmental interdependence hypothesis. He 
believes that second-fenguage competence depends on 
first-language competence at the time of intensive 
exposure to the second language. 

If the group being evaluated is from one early grade, the 
pretest to posttest relative loss may result from 
intensive English exposure before the child had 
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Condition 3 

Totally Nondiscriminant Case. There are significant 
pretest-posttest and interaction effects. Although there 
is significant growth relative to monolingual English 
norms, there is no between-groups effect. Large 
between-subjects error variance would explain 'this. 
Snow and Hoefnagel-Hohle (1978) have reported the 
possibility that although second-language acquisition is 
predicted by patterns of first-language acquisition, 
there are large individual differences in these patterns. 
The interaction effect would indicate that such patterns 
are mediated by amount of ESL experience. 

If there are growth types that characterize groups of 
children, then these groups should be analyzed 
separately. Results from tests that are. sensitive to 
varied growth patterns may be difficult to analyze. A 
structured, sequenced ESL curriculum may reduce this 
individual variation. Caution should be taken that the 
instruments sensitive to the curriculum. 
U i 



Totally Discriminant Case. There are many explanations 
for all effects being significant'.. Some are appropriate to , 
specific age groups. 

Walbefg, Hase and Rasher (1978) present, results that 
would explain the circumstance in which the most 
experienced* group is gaining least. -They report that 
rapid gains teacher ratings of the English skills of 
Japanese children often diminish over time. This occurs 
when the children are placed/lfTa "homogenous" 
seconcManguage environment, snmilar to how Cummirls 
(1979) describes submersion programs. 

If th§ smallest gains are associated with less program 
experience, the outcome may reflect a curriculum 
which strengthens the native language skills of younger 
students, in the. first program year(s) before English is 
introduced to the classroom. More experienced children 



would appear' to gain more during the same 
pretest-posttest interval . 

If the largest gains are made by the middle experience 
group,' it may be that the more experienced students 
have attained a ceiling on the test which has not yet 
been reached by the middle experienced I students. 
Alternately, less experienced i students may have 
initially shown English growth during a period of, 
firs^t-language strengthening. After a growth spurt in a 
later year, the students may begin to approach the test 
ceiling. 

These explanations depend on English growth being 
curvilinear Sampling students with different years, of 
experience approximates sampling different points of 
the growth gurve. Figure 2 illustrates this. 



/ Figure 2 

Hypothesized Curvilinear Functions 
of English Growth 




Third Year 



Second Year 




First Year 



4- 



4- 



Pretest 



Posttest 



Longitudinal Results 



One last explanation .for the phenomena is that all 
program students in " a grade received the same 
curriculum, regardless of program experience. This 
curriculum would have differential utility to the 
students, depending on their experience. An mstrument 
sensitive to these effects would provide useful 
evaluation information and should be used. 



Condition^ 

Totally Nondiscrirpinant Case. Condition 4 could result 
from large individual variations in growth, which result 
in violation of the statistical additivity model for 
repeated measures (Myers", 1 97.2). Perhaps the 
instrument measures skills that 3 are unrelated to 
program exposure. If there are no systematic factors of 
interest to be partitioned ^sociocultural' factors, sex, 
■ etc.)..- the • test should not be used for evaluation 
purposes. Alternately, the curriculum may not be 
helping students learn English. 



Totally Discriminant Case.. If ttie test is sensitive to 
differences related to program experience, it is not likely 
to be insensitive to. growth within a year Large 
within-subject error variance might account for this. 
Again, individual differences in growth rates may make 
the statistical model very conservative. If there are 
effects of interest that can be partitioned, this may 
reduce error variance. If not, the test is probably not 
very valuable for evaluation- \-\ 

In the New Jersey study, significant effects were 
attribute^ to program experience in Levels I and III, 
without significant pretest to posttest effects. In Level! 
(grades K-2), grade level . Interacted with pretest to 
posttest growth. Second graders made significant gains 
relative to monolingual English norms, while first 
graders did not. In Level 3 (grades. 7-1 2), the LAB did 
not reveal any patterns of interest in pretest to.posttest 
; growth. - 



Interpretation V 

The first decuHbn to be made is whether the • 
instrument's discriminant validity warrants its use for 
placements Some manuals (for example/ Language 
Assessment Scales,, DeAvila and Duncan, - 1975; 
Miranda Test of English Proficiency.' Miranda and ' 
Associates, inc., 1.980), address this issue. Instruments 
that are tf£lid for placement might then be considered 
for evaluation uses. 

In the examples above, discriminant - validity is 
estimated by how w^JL^pnore experienced ESL students 
ar& statistically* sepsffatid from less experienced ESL 
students^jpiative to monolingual English norms. By 

"current language acquisition theory, more experienced 
students would only score lower relative to these norms 
if they have been exposecKto, English before they were 

. sufficiently competent in their native languages. When 
the level of linguistic competence is controlled; a test 
thai is homogenous over the domain should, by the 
linguistic interdependence theory, yield higher scores 
for more experienced students. 

One ^solution for using monolingual English norms to 
evaluate ESL programs is. to partition groups by 
program experience. In this way, the standard for each 
grpup is not the performance of monolingual English 
peers, but rather, the relative performance of other 
program students, as they gain academic experience in . 
English. <*' 

The evalqatian hypothesis is that studehts'in their last 
program year(s) will outscore students in their first 
prpgram year(s), relative to monolingual English peers. 
By looking at pretest to posttest growth within 
experience cohorts, observations may be made about 
rates of growth, as well. 

One final caution in using the /nonolingual English 
norms deals * with the violatiori of the statistical 
assumption of normal distribution. These data will 
almost always be positively skewed: Table 1 shows the 
skews for pretest and posttests are homogenous within 
grades and experience levels. Such violations of the 
assumption slightly reduce the probability of 
computing large F-ratios (Myers, 1972). 

Of course, the - benefit of . using NCEs based on 
monolingual English norms is that, they have intrinsic « 
meaning. These scores estimate student performance in 
English relative to that of mainstream students. 



Table 4 

Checklist for Test Use 
in ESL Evaluation 



The following checklist provides a review sheet, for/ 
programs to use in estimating whether a placement 
instrument wiTl be useful for evaluation purposes/ . 



Assurances 



Related Condition 



V v 



Content Validity, homogeneity over 
domain \ All 

2. Measure and/^rjpaftitiort v related 
sociocultural and other student ' 
factors . " - . 1,2,3 

3. Measure first-language competencies 2 

4. Measure "context-reduced" language 

skills. 2 



Totally Discriminant Test-Failure to Discriminate ■ i 
Experience Groups 

The above conditions assume that totally (Jiscriminant 
tests would , distinguish more-experienced from 
less-experienced groups. This assumption implies that 
there were monolingual English students in the 
normative sample who scored lower on the instrument 
than some of the LEP students. If there were no Overlap 
in scores, LEP students of all levels of program 
experience would score an NCE of 1. 

When floor effects are extreme, the monolingual 
. English norms should be abandoned. A scale derived 
from LEP student scores'would betfer. serve evaluation. 
This should certainly be done if the high' proportion of 
LEP students . scoring a 1 does not improve with 
increases in program experience. 

Raw pretest scores for each grade, within years of 
♦experience, should undergo normal transformation. 
Posttest scores*could be converted to'tjje same scale by 
subtracting each from the pretest mean and dividing by 
some function of the pretest standard deviation. 

For ear£h grade, a spring scale could be developed for 
eactf of three (or more) years of program experience. 
Transformations of scores for students entering, in the 
fall could be interpolated. The interpolated value would 
fall between one transformation using the mean and 
standard deviation of first-year students in the students' 
current grade levels and another transformation using 
the mean and standard deviation of first-year students 
in the previous grade level 

^ This is a once-a^year testing model, except for' two 
groups: 1) first year fall entrants who would be Jtested 
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p/ete 



on entrance and pretested in the spring; 2) students 
who change test levels and .would have to be pretested 
in the falfof thq'yoor of change. The fall score would 
an interpolated transformation similar to that used for 

fall entrants. r. . 

• . • i , ■' 

Each spring's scores would serve as a posttest for the 
concluding year and.pretest for the coming year. These 
two functions could be served using two . 
transformations. 'As a posttest, scores would be 
transformed based on' the statistics of the previous year 
of experience/and as a pretest for the coming year, the 
transformation would use statistics for the coming year 
of experience. For example, sprincj pretest scores'for 
first-year students would be transformed based on 
means and standard deviations for first-year students. 
Their posttest scores the following spring would be 
transformed using first-year mean and standard 
deviation for the next grade level. This same^core could 
be used as a pretest for the coming year by 
transforming it based on the second-year mean and 
standard deviation in that grade level. j 

' in this manner, the normalized scores would always 
control for , grede progression without being 
confounded by identical program interventions (as with . 
LEP norms) between treatment group and normative 
samples. Separate statistics for transformations need 
not end at three years of experience, but large enough 
samples with more experience may not be available to . 
yield stable means and standard deviations. Table 5 
shows this. 

» Table 5 . - ■ . °- 

Choice of Appropriate Scales 
(Example) 

Scale - 
Grade Experience Group Test Experience Grade 
} ' 1 Group 1. pretest, first year— first grade 

2 Group 2, pretest, second year— first grade 

3 . Group 3, pretest, third year^-first grade 

. 2 V Group 1 . posttest. first year— second grade 

2 Group 2, posttest, second year-^second grade 
: Group 1. pretest, second year— second grade 

3 Group 3, posttest. third year— second grade 
Group 2. pretest, second year— second grade 

New Jersey data suggest that growth is a function of & 
grade level. In high schogK-the gains made in different 
grades are small and approximatefy 'equal. It may be 
more efficient to develop- norms for. whole grade 
clusters, e.g.. 10-12. rather than within separate grades 
for these students. 



Finally, the suggested scales are not without problems. 
They do not control for cultural exposure to English 
from the media, friends, etc.* However, this procedure, 
especially for more experienced students, may be the 
best approximation of English-skills growth as a 
function of program exposure. : 



Nonnormative Meanings of Test Results 

Some available proficiency instruments, e.g.. the 
Bilingual Syntax Measure (Burt. Dulayrand Hernandez, 
1975) and the IDEA Oral Language Proficiency Test 
(Ballard and Tighe, Inc. 1979), provide results that are • 
meaningful in terms of linguistic theory. These 
instruments hypothesize taxonomies of language skills 
and plot student progress along these theoretical, 
structures (levels, etc.). 

Meaning of results from normed tests usually comes 
from the norms, but other meanings might be given ,by 
relating student skills to the test results (Angoff, 1971). 
For example, teachers' could group test scores (every 
five points) and record^cgport card^ grades in ESL. For 
each student achieving the scores within every group, 
abilities associated with the modal course grades for 
each group of scores would be associated with the test 
scores. ' * ■ 

■ "■ * ■ 

Similarly, Firidley. and Nathan (1979) propose methods 
of evaluating ESL programs from a performance 
perspective. Teachers could rate whether or not 
students have mastered various performance objectives 
at the time of testing. Again, test scores might be 
blocked into five^point intervals, and an assessment 
may be made of whether or not most students scoring 
in each Interval have mastered each objective. Lists 
may be made of mastered objectives "associated with 
each scoring range. 

Angoff also suggests that such . methods may be 
reversed. Mastery of performance objectives 'and/or . 
course grades cduld be standardized by showing the 
distribution of test scores associated with ^ each 
objective mistered or different course grades. The 
modal test score (blocked in small intervals) would 
represent the score associated with each mastery or 
course level., 

■ ■ • ■ • 

Hambleton and Cook (1977) explain that an ability scale 
could be determined using latent trait models. Scores 
on this scale are related to the probability of passing 
each test item and. therefore, have meaning in terms of 
test content. Programs with large populations might 
consider similar applications of latent trait models. 

This option might also be used in reference to 
monolingual English 'norms. The. test could be 
calibrated to describe proficiencies of monolingual 



•Programs concerned with these factors might employ three-point models in which intervals spanning academic years are compared with summer • 
tintervals.spanning vacation periods * " . .+ .. . ■* 
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English students at each ability score." LEP students 
attaining various ab[lity scores would have 
proficiencies similar to those of the normative student. 

Finally, itenri calibration may be used both Mn' 
criterion-reference testing and normative testing 
(Wright, 1977), Itenfe may be drawn from banks that are 
tailored to students' proficiencies and referenced either 
to norm or performance criteria. 



Summary and Conclusions 

Placement tests and. ESL evaluation tests both measure 
English proficiencies. The placement tests must 
determine how well students could perform in English 
in the. academic settirtg. Since monolingual English 
students are normally instructed in this environment, 
placement instruments are reasonably normed on 
monolingual English school populations. . . ■ •' 

It is also reasonable that these instruments might be 
used to evaluate the ESL instruction, either as a 
component of bilingual education or as a program itself. 
Such use of these tests raises the question of whether 
test norms may be used as proxy comparison groups. 
The growth of monolingual English students, however, 
may not be an appropriate standard to gauge the 
growth of' LEP stucjenteL Other norms based on LEP 
students are also problematic. > J 

A proposal for using; monolingual English norms, in 
which LEP students are grouped by program 
experience, is offered. The scores of each group relative 
to the monolingual English norms are compared. Such 
analyses , are somewhat conservative. This use of 
monolingual English norms relies heavily on a control 
(or at least, measurement) of first-language skills before 
intensive introduction to English and a review of the 
homogeneity of trie test. ;» . 

Finally, Angoff (1971) observes that norms acquire 
meaning through familiarity: In this sense, monolingual 
English norms are valuable because their meaning is 
immediate. They are used in placement to define how 
LEP students would perform in monolingual classrooms. 



The .placement and evaluation uses of these norms may 
not have different meanings. The placement test tells 
how far a student must go in the ESL program. The 
evaluation test marks his/her progress on -the route, The 
placement test is merely pne point along the route." 
Moreover, each test should have a meaning in terms of 
curriculum. The placement test should prescribe areas 
of particular linguistic need, and the evaluation test 
should mark student improvement in those areas. Both 
of these functions demand, instrumeji t sensitivity to the 
same student capacities. 

Finally, we have discussed these instruments as though 
they yield a single score for normative comparisons. 
Actually instruments may provide separate reading, 
Writing, speaking and listening norms. In this case, the 
arguments would apply to each of these areas 
separately. Each subtest should be homogenous, etc., 
placement could be accomplished through profiles 
across, the four areas. This would help target curriculum 
to a particular .-student's needs. Separately normed 
subtests would also .enable teachers to evaluate a 
specific student's improvements in the targeted heeds 
areas. r « . t 4 
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Gains in NCEs on the Language Assessment Battery 
by Bilingual Education Students in New Jersey 
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The multifaceted multidimensional fabric which 
comprises the individual and shared domains of 
language and cognition give rise to a complexity that 
beckons the researcher to investigation. Language and 
cognition, as well as their interaction, have been 
intensely examined by the research community. In spite 
of this vigorous investigation, many issues still warrant 
clarification. What role does language play, in 
cognition? How do language and thought interact? 
These questions have created major controversies and 
resulted in voluminous reports, yet the controversies 
and questions still remain, The focus of this paper is on 
the interaction of language and thought* more 
specifically the role that'languacje plays in the human 
thought process as seen from the perspective of 
bilingualism. 



Theoretical Background 

. What role does language play in thinking? Is language a 
mandatory prerequisite for logical thought? These 
questions "have been intensely debated and the 
opposing views of Bruner and Piaget regarding the 
interrelationship of language and thought have 
emerged and dominated the arena of psycholinguistics. 

Piaget (1967) views language as an important facilitator 
of- thought, but not as a foundation or a necessary 
prerequisite to cognition Cognition, in general, and 
higher level logical thinking, in particular, do not have 
their roots in a linguistic base, but rather in the sensory 
interaction with the environment. As Inhelder and 
Piaget (1964. p.293) note, 

Whether a child understands words like "air 
, and "same" or any other form of words used to 
refer to the concept of class inclusion and 
similarity, whether he understands the sort of 
language we use to refer to the asymmetrical 
and transitive relation of a series, these are 
questions that are mainly dependent on the 
level which he has'reached in the development 
of operational behavior and that develop- 
ment is. relatively independent of any other, 
bedause it is governed by its own laws of 
equilibration. We. therefore, say that language ■ 
is not a sufficient and necessary reason for the 
process. 

Thus, language is clearly not given the status of a 
mandatory requirement of thinking. Even abstract 
logical thinking developing in the formal operational 
stage of .cognition, which is viewed by many as being 
synonymous with or at least embodied in language, is a 
view clearly rejected by Piaget. Piaget clearly states. 

^ -59- 



... in spite of appearances and current opinion, 
the essential characteristic of propositional 
logic is not that it is verbal logic. First and 
foremost, it is a logic of all possible 
combinations, whether these combinations 
" arise in relation to experimental problems of 
purely verbal . questions . . the real power 
propositional logic lies not in this (verbal) 
support, but rather in the combinational power 
which makes it possible for reality to be fed 
into the, set of possible hypotheses compatible 
with the data (Inhelder and Piaget. 1958. p.253). 

In contrast. Bruner (1964; 1966) opposes Piaget's 
model of the independence of thought and language. 
Bruner perceives language to be closely intertwined 
with cognition and 3s ah underlying requisite to logical 
thinking. , Bruner (1964) postulates that intellectual 
growth in man can only occur as a result of his 
representational and integrative abilities. In sustaining 
cognitive growth, one must be able to represent in an 
understandable fashion recurring events . and 
regularities within his environment, as well as integrate 
past events with present. Bruner (1964) further 
hypothesizes three developmental stages of cognitive 
growth which are in fact developmental technologies or 
modes of internal representation of the complex 
environment which defines the existence and 
parameters of cognition. The first means of 
representation of which the child avails himself is the 
enactive representation which embodies a motoric 
representation of past events. Bruner' (1964). p.2) cites 
the following exampl^ 

We cannot, for example, give an adequate 
description of familiar sidewalks of floors oyer 
which we habitually walk, nor do we have 
much of an image of what they are like. Yet we 
get about them without tripping or even 
looking much. Such Segments of our 
environment — bicycle riding, tying knots, 
aspects of driving — get represented in our* 
muscles, so to speak. 

This is followed by an iconic representation where the 
individual can represent events via perceptual imagery 
and the selective organization of these images. The 
most advanced representational tefchnique is a 
symbolic representation of events. It is in this stage that 
Bruner postulates language to be the means enabling 
the formation of symbolic representations^and to be the 
primary force propelling tKte qhild from the static iconic 
stage to the more flexible symbolic representation. 

Thus, it is via language .that; we., form symbols, and 
language and cognition becom^ inseparably* 
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intertwined, with the Matter's very existence dependent 
on the former, Bruner, (1 964, p.4) claims, 

Once the child has succoeded in internalizing 
language as a cognitive instrument, it becomes 
possible for* him to represent and systemati- 
cally transform the' regularities of experience 
with far greater flexibility and power than 
before. 

He continues, 

In children between 4 and 12, language comes 
to play an increasingly powerful role as an 
implement of knowing. Through simple 
experiments, I have tried to show how 
language shapes, augments, and even super- 
cedes the child's earlier modes of processing 
information. Translation of experience into 
symbolic form, with its attendant means of 
achieving remote reference, transformation, 
and combination, opens up realms of intellec- 
tual possibility that are orders of magnitude 
beyond the moist powerful image-forming 
system. (Bruner, 1964, p. 13) 

Research evidence has been presented in support of 
both sides of this theoretical controversy, Bruner (1964) 
gives evidence that language can enable the 
transitional child to conserve if initially shielded from 
the perceptual, distortion and allowed to verballf 
represent the situation prior to visually perceiving the 
event. Thus, the symbolic representation of language 
allows the child to surmount his being perceptually 
bound to the situation giving credence to the critical 
role of language in thinking. However, when Bruner's 
concept that thought has a linguistic base was 
extended to suggest that l&nguage training can develop 
conservation ability, the results of a study by 
Sinclair-derZwart did not support this hypothesis, (t was 
determined that cognitively advanced children who 
demonstrated conservation ability did in-fast possess a 
linguistic 1 sophistication employing more relational 
terjms, more highly differentiated terms and coordinated 
descriptions. However, training children to use these 
linguistic expressions did not improve their 
conservation ability, which bolsters Piaget's view of 
language as a reflection of thinking, rather than the 
basis for it. , • 

Similarly, Furth (1966) compared the cognitive 
development of hearing children and deaf children who 
have no linguistic input and who develop no, or limited, 
linguistic ability. Bruner's theoretical perspective would 
dictate that deaf, linguistically limited children should 
display deficient cognitive growth, yet no substantive 
differences were noted supporting Piaget's contention 
: that thought is not founded in language. 

in this paper, similar to Furth (1966), the ongoing 
'controversy of the role 6f language in tfbgnition will be 
^examined by investigating the cognitive development 
of a special population — bilinguals. The theoretical 
premise the reader is asked to accept is as follows: if 



Piaget's theoretical perspective regarding the 
independence of language and thought is correct, one 
would expect that bilinguals and rr)onolinguals, despite 
their different linguistic development should not 
display cognitive differences since language is not the 
basis for thought and even complex logical processes 
have their roots in environmental interaction as 
opposed to language, If, however, Bruner's theoretical 
stand on this issue is cocrect, and in fact, language and 
thought are synonymously inseparable with language 
being the core of the symbolic mode of representatipn, 
then the wider linguistic range and more diversified 
linguistic structures of the bilingual child should have 
an enhancing effect on hia.cognitive development The 
bilingual child' should ^display a qualitative or 
quantitative cognitive difference compared to 
monolinguals in the embedment of more advanced 
abstract or symbolic thought forms or in the display of a 
more rapid cognitive development 

Research Evidence 

If intelligence tests are valid, research relating IQ scores 
and bilingualism can be brought to bear on this 
controversy. Piaget would maintain the expectation of 
no significant difference in IQ, while Bruner would 
expect bilinguals to attain higher IQ's because of their 
advanced linguistic . and cognitive development 
However, as was noted in a previous paper 
(Kirschenbaum, 1982) the results of -correlational 
studies of bilingualism and IQ range the gamut from 
intellectual deficiencies to intellectual enhancement. 
The contradictory fjndmgs as well as the very limited 
generalizability jtfFThe results of most studies 
investigating id and bilingualism 'were largely 
attributed to gr#>s methodological inadequacies which 
were outlined in a previous paper. (The reader desirous 
of a compreHeYisive critical analysis of this area should 
see Kirschenbaum, 1982). 

The study by Peal and Lambert (1962) is one.that does 
warrant further discussion because of its distinctive 
status of being methodologically rigorous employing 
balanced bilinguals, adequate controls, and appropriate 
standardized verbal and nonverbal intelligence tests in 
the proficiency language of the bilingual. (The only 
methodological criticism that is applicable is the lack of 
longitutional data). This study engaged 10-year-old 
monolingual and balanced bilingual French-Canadians 
and- measured intelligence on both verbal and^ 
nonverbal IQ tests. Moreover, this study npt^nly 
" quantitatively computed the IQ's, it also qualitatively 
compared the intelligence of monolinguals and 
bilinguals finding several qualitative drfferences. In 
. analyzing the performance subtests, it was determined 
that more than simple visual motor activity was 
involved in those subtests in which . bilinguals 
surpassed - monolinguals. These subtests required a 
symbolic reorganization involving abstract concept 
formation and an ability for quick "cognitive 
reorganization and flexibility. Peal and Lambert (1962) 
hypothesized that because bilinguals have Two words 
for each referent, i.e., two symbols for each object, 
bilinguals must- conceptualize their environment in 
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terms of abstract general qualities. This Is especially 
true for the compound bilingual who learns both 
languages in the same setting arft. therefore, with the 
same referents. Peal and Lambert (1962, p. 14) state: 

This ability to -think in terms of, abstract 
concepts and relations, independent of the 
actual word, apparently is required in the 
symbolic reorganization type tests. The 
monolinguals fnay have been forced to form 
concepts or abstract ideas of things and may 
be more likely to think in terms of concretes. 
They could not be expected, therefore, to be as 
agile at concept formation as the bilinguals, 
and they might appear handicapped 
comparatively. 

Additionally, the increased cognitive flexibility noted in 
bilinguals may be rooted in their consistant alternating 
from one language to the other. Similarly in the' 
performance testS administered in this study, a 
cognitive flexibility is required since symbolic 
reorganization often calls for substituting one concept 
for the other. 

A factor analysis of the bilingual and monolingual 
performance on the intelligence test also revealed that 
the bilinguals developed more independent abilities and 
a more diversified structure than monolinguals. This 
fact takes on extreme'significance in light of Guilford's 
(1959; 1967) definition of intelligence as a general 
factor composed of smaller specific factors based on 
the person's experience. Thus, not only does the 
bilingual's cognitive structure differ, but this difference 
engenders an intellectual superiority when compared to 
the monolingual. 

How does one explain these cognitive advantages 
observed in the bilingual child? Bruner can explain 
them easily citing language as underlying thought, and 
the wider linguistic range and structure of the bilingual 
enhances his use of the symbolic representational 
mode. However, how do Piagetians deal with these 
resujts? In attempting to reconcile these results with 
Piaget's theory one cannot quickly dismiss the criticism 
that this study is not longitudinal. Since this research is 
not longitudinal and represents correlation at a 
particular isolated moment, one cannot assume 
Causality. Thus, we cannot determine if bilinguals are 
more intelligent because of their bilingual nature, or 
whether more intelligent cognitively flexible people 
with a greater abstract contept formation ability tend to 
become bilingual. 

There is, however, additional experimental evidence 
supporting the hypothesis that the cognitive 
development in the bilingual child is different from that 
of the monolingual child. Leopold (1971. p.14) found. a 
"noticecN*Jfr*6oseness of the link between [the phonetic 
word and its meaning." Leopold (1939) conducted a 
longitudinal pbservation of his German-English, 
bilingual daughter from the time she exhibited speech. 
Leopold observed that unlike monolingual children his 
daughter did not insist on stereotyped wording in 



stories, often substituting worcte in 8on09 or rhymes. 
This represented a distinction between words and 
meanings, and thus. Leopold (1939; 1 97 lj claimed that 
the bilingual developed an early, premature ability to 
form concepts and a realization that words are not fixed 
and concrete, Apparently bilinguals, as opposed to., 
monolinguals, do not have a concrete attachment to 
words, but they do have an advanced ability to form 
abstract concepts, a finding supported in Peajf and. 
Lambert (1962). ^ 

This finding receives further support from lanco-Worral 
(1970). This study attempted to test Leopold's 
hypothesis of an accelerated cognitive development in 
bilinguals, as specifically observed in the looseness 
between words and meaning. A total of 30 
Afrikaan-English bilinguSIs (fronrva one-language home 
environment) divided in two age groups, 4-6 and 7-9. 
were tested in a series of two experiments. An 
equivalent group of English and Afrikaan monolinguals 
were also tested, A multifactor determination of 
bilingualism was conducted via mother interviews, a 
test of bilingualism, and teacher reports, 

The first experiment tested the semantic-phonetic 
preference in the bilinguals* and monolinguals, Each 
child was given a set of three words: a standard word 
and two choices, one which was semantically similar, 
and one which was phonetically, similar (but 
semantically different). It was found that young 
bilinguals could semantically match Words to a 
significantly higher degree than Young (4-6) 
monolinguals. Both groups of older children (7-9) 
indicated semantic preference and chose to match 
words on a semantic rather than a phonetic basis. 

The second experiment tested the bilinguals' and 
monolinguals' recognition of the arbitrary xjature of 
names and their ability to separate names and meaning. 
In this experiment a questioning method of uygotsky 
was used. The following are sample questions, f, 

1. "Why is a cow called a 'cow?'" For all groups, 
bilingual and monolingual, the responses wefe 
largely in terms of attributed qualities.. For 
example, "a 'cow is a cow' because you milk it." 

2. "If we were making up names, could we call a 
dog a cow?" The results indicate that in both 
the younger and older groups, a .greater 
number of bilingual children agreed in principle 
the names could be changed, while a majority 
of monolinguals said they could not. 

3. "Let's call a dog, a cow '— does this cow have 
horns?" The response to. this question 
improved with age, but there was no 
significant difference between monolinguals 
and bilinguals. 

It is concluded from the results of this study that 
Leopold's obsfervation was valid, and the bilinguals' 
development indicates, a two- to three-year 
advancement over monolinguals. This also points to a 



superior ability to form abstract concepts and 
conceptual relationship because on a continuum of 
concrete ~- abstract, word meanings aro more abstract 
than word sounds. Tho bilinguals' ability to substitute 
, names in principle, as opposed to the monolingual 
also exemplifies their cognitive flexibility and advanced 
conceptual development. 

Another study (Feldman & Shen, no date) supporting 
the hypothesis that bilinguals have a different cognitive 
development than monolinguals, tested five-year-old * 
bilinguals' and monolinguals' ability in tasks involving 
object constancy, naming, and the use of names in 
sentences. It was assumed that since bilinguals have 
two codes for all referents, thte would facilitate their 
awareness of the arbitrary nature of names. It was, 
therefore, hypothesized, that in the aforementioned 
tasks, a bilingual child, would display advanced 
cognitive development, as opposed to a monolingual 
child. 

The object constancy task entailed showing an'object 
to the child and physically transforming it while viewed 
by him/her (e.g. a match burned, a cup crushed). It was 
then placed with an identical 'pre-transformed object, 
and the child was directed to pick up the object he was 
previously shown. The naming task consisted of the 
child's ability to call objects by their common names / 
(call a cup, a "cup"); to learn nonsense names, (call a 
cup, a "wig"); and to switch common names (call a cup, 
a "plate"). In the sentence task, the child had to use 
these three labels (common, nonsense, switched 
common) in various sentences. 

The results indicate that bilinguals had -done 
significantly better on all three tasks, with the sharpest 
disparity in comprehension measures a^opposed to 
production. (See Seidner. 1982.) A further* analysis 
J reveals that bilinguals were equivalent to monolinguals 
in their facility to acquire new names (nonsense labels) 
and their general knowledge of common names. 
However, bilinguals surpassed monolinguals in all other 
areas of naming, using names in sentences, as well as 
object constancy. Thus, it appears that bilinguals have a 
superior cognitive structure in these areas of 
development. 

Further support for cognitive development differences 
between bilinguals and monolinguals is derived from 
the fact that bilinguals use more divergent thinking 
than monolinguals (Scott, 1973), which is considered 
an indication of creativity by several authors (Wallach & 
Kogan, 1965). In this study (Scott, 1973), divergent 
thinking was taken to be synonymous with a unique 
cognitive style, rather than a creativity predictor. Early 
divergent thinking was measured by the unusual uses 
subtests and late divergent thinking, by the lines and 
patterns subtests of the Wallach and Kogan tests. 

The results of this study indicated mpre divergent 1 
"thinking in bilingual subjects^than monolinguals. Scott 
(1973, p. 10) states. The results of this study seem to 
indicate that bilingualism can .both influence and be 
influenced by divergent thinking wj^ich operates as 



both causo and effect," The author discusses the results 
in terms of supporting the hypothesis that bilinguals 
have greater cognitive flexibility than monolinguals. 

However, not all the evidence points to this differential 
development in bilinguals. Burling (1971), in observing 
his bilingual son, claims he did not find the looseness of 
linking words and meaning that Leopold (1971) 
discussed. In the longitudinal observation of his son, he 
claims: 

There can never have been a child more 
obsessional in this respect than Stephen. When 
we read to him he insistently protested the 
slightest alteration in any familiar text. This 
was true even though we read to him in his 
second language, English, where he would 
suppose the form and meaning to be the least 
rigidly identified. It must have been an 
idiosyncratic trait rather than bilingualism that- 
freed Leopold's daughter from insistence upon 
stereotyped wording (Burling, 1971, p.185). 

Another longitudinal study concerns the St. Lambert 
experimental school. This experimental school was 
designed to foster bilingualism by primarily instructing 
children in the second language. This experiment 
involves the experimental bilingual school, as well as 
English and French insttfuctionschools used as controls. 
* The control and experimental group were matched on a 
number of variables. However, the basic design of this 
study was to examine the feasibility of this 
experimental school in fostering bilingualism. not to 
examine cognitive differences of bilinguals and 
monolinguals; Therefore, there may have been. some 
children who were to some extent bilingual in the 
control groups, thus limiting the applicability of this 
study. However, if bilingualism causes a differential 
development, one mav expect some cognitive 
differences in children attending a school promoting 
bilingualism. 

In examining the cognitive development of children 
after grades one and two. no cognitive inferiority or 
superiority oyer controls was reported. In a more 
conjprehensive report (Bruck, Lambert, & Tucker, 1973) 
noting the ; scholastic and cognitive development over 
six years, no cognitive advantage or disadvantage was 
found after grades four. five. ?nd six. (lt<should be noted 
that on a test of cognitive flexibility in grades three, five 
and six, the pilot experimental Slass performed 
significantly better than controls. Hovvever. the authors 
do not relate this superior performance to their bilingual 
schooling because they did not obtain similarresults for 
follow-up classes.) 

- Conclusion, Summary and Implications 

In conclusion, refocusing on the previous theoretical 
discussion, Bruner's theory is bolstered by those studies 
showing bilingualism to have an enhancing effect on 
intelligence. In addition, there are many studies that go 
beyond a mere correlation with intelligence and 
examine the qualitative cognitive development of the 



bilingual child. The bulk ol those studios indicates a 
differential cognitive dovelopmont in the bilinguol child 
os opposed to tho t monolingual, Theso studios wore 
discussod in terms viewing the bilingual child as having 
greater cognitive flexibility, Superior ability to form 
abstract concepts, a more highly diversified cognitive 
structure, greater divorgent thinking ability. Q less 
concreto attachment to words, an earlier objoct 
constancy dovolopment and less ogocentricity Those 
findings wore anticipated according to Brunor's theory, 
and therefore lond support to it 

The final remaining question is if the Piagetian can 
explain the recent evidence indicating bilingual 
children have numerous cognitive advantages? There 
are some indications, as noted, that these advantages 
do not exist, and the bilingual's and monolingual's 
cognitive development do not significantly vary. 
However, the bulk of the evidence presented in the 
latter part of this paper, seriously questions Piaget's 
view on language and cognition. 
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Introduction 

Rotborfl (1982) statos that thorn are 3,6 million 
school-aged children in tho United Statos with a limited 
ability in English, It is roasonablo to assumo that within 
any given population there exists n 'normal distribution 
of both giftodnoss and handicapping situations, 
Assessment for giftodness, general achievement, 
behavior disordors, and language or learning handicaps 
is complicated when students come from homos in 
which a language other than English is spoken. 
Achieving an adequate assessment of' bilingual 
students has been and continues to be an area of 
concern for educators. The limitations of existing 
instrumentation have been discussed in terms of 
cultural bias (James. 1981; Bernal, 1975), horming or 
standardization procedures (Hilliard. 1981; Pletcher, 
1978) and validity (Law 1977; SWRL, 1980). Because f 
the topic of this paper deals with the use and misuse of 
instruments, those particular issues of concern to 
educators and practitioners will be addressed. 

This paper will discuss twj>£reas: A) the purpose of 
student assessment in termSv&f the referral question 
based on the various kinds of testing including 
screening, identification, student/program evaluation, 
and exit or reclassification, and B) problems that exist 
resulting in test'misuse. For the purposes of this paper, 
language proficiency testing will* be emphasized 
because it is within the framework of language testing 
that most programmatic decisions are made. 

Purpose of Student Assessment 

Important to consider in any discussion of testing is the 
function of tests. Unless the end result of testing is to 
improve the quality of educational service, the overall . 
value is questionable. Aran-Mendez (1975) states that 
"the most important function of the test is to obtain all 
possible information about the student so his/her 
learning could be guided and his/her growth promoted" 
(p. 86). 

Educators often ask experts to identify which is the 
"best" test. There exists a fallacy regarding the inherent, 
goodness or badness of a test. A test's goodness or 
badness can only be considered in relation to two 
overlapping domains: logical and ethical. The logical 
domain deals with test appropriateness. A "good" test is 
one that is considered appropriate as it relates to the 
assessment need juxtaposed with the referral question. 
Invariably, students are referred for testing without an 
adequate problem statement accompanying the 
referral. ItJ^e essential for the evaluator to ask the 
individual making the referral to articulate the variables* 
that 'led to the request that an assessment be 
conducted. Such questions to ask the teacher are^Was 
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there a question regarding language interference! 
Would testing facilitate program entry or exit decision 
making? Should the relationship be explored between 
the child's, first artd second language proficiency? Is 
t(9Sting;necessary to aid in curriculum planning? These 
and othor questions are logical extensions rising out of 
a cloarly articulated rjeed. There are literally hundreds of 
good tests that are availablo for us*). Any tost, however, 
is only as good as the information that it yields in 
, response to a pro-specifiod referral question. 

Tho second domain to consider regarding tost 
goodness or badness deals with the technical aspects 
of tests. A deliberate simplification of these aspects 
fojlows in order to demystify certain concepts 
surrounding psychometrics. Figure 1 addresses test 
reliability and validity. A test is valid if it measures what 
it claims to /measure.. This concept is of partioular 
interest to bilingual educators because of the potential 
hazard surrounding tests that are constructed specific 
to one single reference group. For example, a test of 
auditory phonemic discrimination in English would not 
be valid for students who'speak little or no English. 

A test may be considered reliable if the results are 
consistent, that is, if the same student under similar 
testing conditions receives the same score repeatedly. 
For example, a child who is a native English speaker 
should repeatedly receive a high score on an English 
language proficiency test. Figure 1 illustrates how a test 
may be reliable but not valid, valid but not reliable, and 
both valid and reliable. These are but a few of the 
technical aspects of test construction. For further 
discussion, see TenBrink (1974). Borg (1974), Glass and 
Stanley (1970). The necessity of the demystification of 
testu is paramount.- Too often teachers ignore their 
intuitive judgment regarding student performance in 
deference to test results. Tests should be considered 
tools which, in the hands of a good craftsman, can be 
helpful in achieving the desired results. 

There are a variety of uses for testing which can be 
grouped into two interrelated categories: Student 
Assessment and Program Evaluation. One of the types 
of student assessment is identification and screening 
instruments targeting students for possible inclusion 
into bilingual programs. Language dominance tests, 
home- language questionnaires, teacher observation 
surveys, etc.. are useful for identification and screening. 
The key factor in this type of assessment is the focus on 
those variables which can determine program eligibility 
as quickly as possible. " • . 

The next type of student assessment is diagnostic- 
testing whose purpose is for curriculum placement, 
program placement or prescriptive teaching. Diagnostic 
tests are instruments which yield analytical information 
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often identifying students' strengths and weaknesses., 
"The emphasis* of diagnostic tests is on gathering 
specific information on what the child does or does not 
know and why, not just how many answers the child 
has correct" (Duron & Kerins 1982, p.49). Diagnostic 
tests can be language proficiency. ^* norm- or 
criterion-referenced tests. 

A final kind of student assessment instrument is used 
for exit/reclassification purposes. This type of test is 
also diagnostic in scope allowing evaluators to 
determine which students previously identified as 
being limited-English proficient . would be able to 
successfully compete in an all-English classroom. 
Language proficiency and achievement tests are most 
often used as exit/reclassification tools to assess the 
listening, speaking, reading and writing skills of 
students enrolled in bilingual, programs. „ 



Program evaluation occurs in a response to a need to 
demonstrate program efficacy to parents and 
community members, local and state decision makers 
and those individuals interested in bilingual education. 
Often program evaluation, particularly in light of Title 
VII. is only designed to meet the reporting requirements 
set forth by the federal government. Program evaluation 
should encompass a variety of data sources including 
interviews with bilingual program and general staff; 
questionnaires to students, staff and parents; review of 
student test results, grades, evaluations; and other 
information such as proposals, class list, meeting 
minutes, etc. 

Law (1977) in his article entitled "Evaluating Bilingual 
Programs" states that the nature of bilingual/bicultural 
education goes against evaluation using a vigorous 
experimental control. He goes on to' say 'that the 
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measurement problems are acute because the 
population is i|fiH|Uo linguistically and culturally The 
fact that oil MuilonU who din eligible lot inugiain 
participation should t>e solved denies an experimental 
design with sampling and cuntml (jiuups Considering 
that tact alone, it is no wonder that there ia a paucity ol 
eaperimental research in the field ol bilingual education 

Problem* unci Mmu«o of inslrurnonu 

Ihe problems and minuse ol msliurimrih will bo divided 
into two areas those concerns related to all formal 
testing situations and ihosu concerns idiosyncratic to 
tests designed for a bilingual population A discussion 
of the most common ahuttas in the ndniinist ration of 
standardized tests is found in Duron and Kenm (\9iYJ) 
These abuses include norhiaridrtrdi/o^ administration, 
ignoring time limitations, incorrect scoring, and 
misinterpretation of the standard error of measurement 

Standardized test administration is necessary in order 
to make comparisons* between students' test results 
end those of the norm group Each tost provides a 
technical manual containing standard instructions 
which need to ba carefully followed to ' insure 
comparability among scores An example of 
nonstandard tost administration is when a bilingual 
teacher translates items from a standardized vocabulary 
tost into tho child's homo language Another error, 
ignorjr)D time limitations, can groatty influence test 
results Norms or. other performance standards are 
invalid when time sequence has been modified An 
excellent tost can bo rondorod usoloss if it is scored 
incorrectly. Simple things such as careless arithmetic 
errors or incorrectly roading of the norm table car) 
drastically change the. total score of tho test Scores 
should be checked and recheckod to avoid these types 
of errors 

The final problem of standardized tosts is related to tho 
standard Wror of measurement This refers to "an 
estimate of the range' of scores that would be found for 
a specified person if that person were to tested-agam 
and again on the same test" (Duron & Kenns. 1 982). The 
"confidence level" of the scoro depends upon the test 
reliability and standard deviation. Scores°on tests with 
low reliability and a high standard deviation would be 
questionable and should be substantiated through 
further testing or other non-testdata - 

The problems and misuse of instruments designed for 
bilingual populations are acute. In the final report on the 
"Development of Entry/Exit Criteria and Associated 
Procedures for Bilingual Education" (1981), the authors 
are critical in their discussion of language 1 proficiency 
instruments. Reporting on the discouraging experiences 
of teachers and other test users, they state that, 
generally, users felt .that teacher judgment was more 
likely to be a valid measure of both language 
proficiency and capability of succeeding in an 
all-Enghsh-medium classroomJ^4Q^ny Jest they had 
been using" (pp. 2-10). 



A problem bust* in that must language proficiency 
tests deal with oral language only While linguists 
disagree tin how langusga should be assessed, it is 
generally agreed that tha instrumentation should 
consist of various components including receptive and 
^expiesaive language There is >* need for global 
instruments that address the child's listening, speaking, 
reading and wilting skills Most language' proficiency 
tests currently on the market are designed to measure 
receptive language 'there are few that are diagnostic 
in scope and aveirfewer diagnostic/prescriptive teats 
that would be useful to classroom teachers in 
modifying their instructional atrategiea 

Because the fmld of bilingual education is relatively 
new. many instruments were poorly designed or 
coritUruetod in order to capitalize on an ©merging 
market Normative data are lacking or poorly 
documontod, and often test reliability is not established 
Those factors considered, it is still encouraging to note 
that great strides have been made in the area of 
language assessment as awareness and psychofTietnc 
sophistication ate continually augmentod As more and 
more language groups are represented, however, 
additional tests need to be developed to assess 
students' native- language skills and to compare their 
relative proficiency between languages 

Tho reality that tests ore not perfect measures of 
student achievement must frame any discussion of 
thoir use pnd misuse As such, tosts can be used 
constructively if they are considered as aides or tools to 
be interpreted carefully The ond results of a good 
tosting program will be within reach if tho following 
occurs: 

(1 ) 'Testing needs and purposes are specified. 
.(2) School personnel are sensitized to interpret 
test results.' ' ^ 

(3) Local norms ore developed. 

(4) Tests are used diagnostically. 

(5) The reporting plan is specified in advance. - 

The importance of specification of testing needs and 
purposes was articulated in ttle first part of this paper 
As stated, formulation of thelreferral question can be 
facilitated if the evaluator anc\ school personnel work 
together Al this time, problems and shortcomings 
surrounding tests can be mutually considered. Test 
scores too often become "magic numbers" to teachers 
who do not look beyond the score and fail to interpret 
the results. 

Essential to a good testing program is development of 
local norms. The following section on developing local 
norms is, reprinted from the Handbook on Special 
Education (1982) with permission of the tiling State 
Board of Education. 
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How to Establish Local Norms * 

»*■ ■ ■ V ■ V •• . 

The first step in establishing local norms is to identify* 
i .the data base which . will be used in the process. The 
^^ata base. might be; (D a set of scores of all target 
students attained during the current academic year; (2) 
scores of students participating in the program during 
the .past years; or (3) scores of students currently 
. participating in the program, as well as those of 
students who will participate in the program in the v 
future. |n the latter case, the test is actually re-nqrmed 
each year by including additional data. Changes in the 
norms might be moderate, with no more,than five years 
. «. Of data collection needed to established state norms. 

A decision about which of v thp three methods to use 
depends on the 1 partipular situation of the jocal district.. 
A limited number of, stucjpnts participating in the 
program during -the current year may weaken the 
validity of the norms used through the first method. The 
second method may be used only if the same 
instrument has been used in the past. Also,' changes in 
curriculum or ^districting might not make this option 
feasible, The third method requires more time and work 
? than the others. . ; 

Once the data base has been identified; raw scores are . 
converted into metrics useful and neccessary for 
making comparisons among the scores. Percentiles, 
; ' stanmes or grade equivalents can be established -with ' 
data from each age or grade-level. : 

v Percentiles 

. Percentiles may be used to compare the*performance of 
■ .individual students with the performance of other 
. individuals or a group of students. Use of percentiles 
helps answer the often-asked question: How well is this 
* student doing in comparsion with other students? 
Performance of any group of students may be also be * 
.compared to the norms by calculating the median (the 
raw score at the*50th percentile) of the group and 
determining the relationship between the raw score and 
the^ff^;Cprrespondence to the 50th percentile^on the 



shed norms. 



Carte be ta1<en f when,; using percentile' ranks 
becausifa scale . of equal intervals is not obtained. For 
example, a percentile rank of 50 cannot be assumed to 
v .be twice as good as a percentile rank of 25 or the 
difference between percentiles of £0 and 10 (2P-10 = 
10) cannot fee assumed to^be equal to a difference 
between ^percentiles of^Sp - ^d 40 (50-40 - 10). 
.■Therefore, percentile ranjffs^annot be used to compute 
jneans, standard deviations, correlation coefficients or 
other - statistics for.^wtych equal interval scales, are. 
required. . * 

Stanines 

After attaining percentile ra^s 'for .each score, jt is 
possible to convert percentiles to stanines which are 




standard scores consisting of nine possible values from ' 
one flow}' to nine (highUror each of the nine possible, 
stanine scores, the percentage of scores ig distributed 
as follows: ; . . . 

■ ■•.*'.. * '. 

• ' Percentage of Scores 
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Therefore, the lowest 4% £>f the scores are assigned tp 
stanine of 1; the next 7% of the scores. are assigned to. 
stanine 2, etc While stanines provide a relatively simple 
way of explaining test results to parents or individuals 
not sophisticated about testing, stanine values do not 
allow for precise interpretation of a particular score. , 

Grade Equivalents 

■ . ', '"'*.* 

Grade-equivalent ?&fcres represent ah estimated 
average (median) performance level achieved by a 
sample of students if they. were tested each month of 
the school year. For purposes of computing grade 
equivalents, the school year, is dividedinto 10 months. 
Grade equivalents are expressed in terons of a yearand' 
a decimal fraction of a year. Fpr example, if a median 
raw score of a sample of thfrd graders in the 5th month 
of the school year (grade 3.5) is £0.' then the grade 
equivalent for a^yy score of 40 is 3.5 (3rd grade, 5th* 
month). 




Grade equivalents sho^r!L .ba&toterpreteci with 
considerable cautidn.^Gradfej^^Hpnts are useful as 
rough indicators of Jhe^ J^^^P performance of 
students at particular ^grade^llwels. • For example; a 
grade-equivalent score of 8.6 ^btained by a sixth grade 
student on a math test doesTiot necessarily mean that 
this student knows math at the 8tK or &th grade level. 
The 8.6 grade equivalent mS^n.s.th'eiixth gcade student 
performed as vVelt on the test as a "typical" student, in 
'grade 8.6 yjl^d perform on the same test ' 




Another ^Hpm in .wnt^preting grade-equivalent 
scores is tTOran equal ihttfrval scale £ .not obtained^/ . 
rexampl£. the difference betwfc^rt grade-equj\^lerit- 
sgdfe&'of 8.6 and 1$6 is not equal to the *djfferiende.^ 
between- gf^e-equiv^leht scopes of 3.6 and 2.6. For this? 
reason, grade-equivg|fei}t scoresfpanriot be averaged, It 
should be emphasiz^dAhat interpretation of test -scores 
using grade equivaMriis is us^u$f the user remembers 
tjiat a grade-equivOTmt score simply indicates 4hat a 
student performs ^^vell as/the typical student at the 
particular. grade ^Mundicf/ted by the grade-eqivalent 
score. ''Mk ■ ■ ,Jff. 



Diagnostic Use of Tests 

A good testing program should include. the diagnostic 
use of tests. For example, certain language proficiency 
tests contain various subtests that can be compared to 
analyze the child's -strengths and weaknesses. This, 
information can be . useful in structuring instruction 
based on pre-established objectives. In addition, if a 
language assessment instrument is given in both 
languages, the child's relative language proficiency can 
be estimated so that decisions are facilitated, regarding 
the language of instruction. Finally, test-item analysis 
on tests like the Boehm Test of Basic Concepts can 
address specific conceptual deficiencies in the child. 
Categories or clusters of items can be identified .in 
. terms of strengths and weaknesses foreach child. 

The evaluator must ask him/herself questions regarding 
the. diagnostic use of tests. These questions deal with 1) 
pattern interaction Wiables As there a pattern to the 
student's responses? *ls there evidence that the errors 
occurring are particular to the home language? To 
English? To both languages?); 2) student .comfort-level 
variables (Is this tt\e level of performance that would be 
expected from thi§ child? If not, has; rapport been 
established appropriately?); and 3) linguistic/cultural 
variables (Do the items that are missed reflect linguistic 
bias? Cultural bias?). 

The final consideration of a good testing program to be 
discussed in this paper is the reporting plan. Important 
to consider is how test results are going to be 
disseminated and to whom. Audiences such -3s 
teachers, parents, the school board, etc., are possible 
recipients. The kinds of reports that will result from data 
aggregation should be planned and decided upon. 
Details such as the type of graphics ancT narratives 



should be considered. Figure 2 represents a form that 
can be adapted for local test coordinators, teachers or 
evaluators. 

In conclusion, knowledge of the use and misuse of tests 
is something 'that enlightened educators must address. 
Familiarity with sources of pneasurement bias,, which 
fexists in' the testing situation itself, is necessary. Bernal 
(1975) contends that lower test scores for bilingual 
children result from those biases mentioned in this 
paper. He recommend^, the following procedures for" 
improving test results: - 

' ** ■ 

1. Organizing students into small groups for 
testing and warmup; / '\. 

2; Matching the examiner with the student for 
ethnicity; .\ 1 

3. Having the examiner spend-a few minutes witry 
the student speaking the home language; 

4. Preparing similar items for the student to 
practice, giving him/her an opportunity to 
artieulate why they answered in certain ways 
and giving them feedback on these items (p. 
51). 

Bernal's procedures and other testing considerations 
discussed in this paper should : not be thought . of as 
stacking the deck *in favor of bilingual students. 
Conversely, these- procedures should take educators 
one step closer to describing the chijd's realistic 
achievement level, often wrongly reflected by^ 
inadequate or misinterpreted te~st scores. Keeping this 
in mind, the assessment instruments c£m be useful tools 
in the hands of a thoughtful, astute evaluator. 
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HOW CAN SECOND-LANGUAGE ASSESSMENT 
CAPTURE COMMUNICATIVE COMPETENCE? 
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Communicative competence is a multidimensional 
construct with broad social, cultural, and linguistic 
parameters. Hymes (1.971) first recognized the effect of 
conte4| on cognition, noting that specific social 
situations dictate the selection of grammatical 
structures in language production. Within the last 
decade, other second language . theorists have 
expanded this concept to include the.knowledga-tfTfhe 
psychological and social customs of one's community 
(Weininger. 1978). expression of interlanguage 
(Selinker. 1974) or approximative systems (Nemser;. 
1974). and pragmatics (Chomsky, 1 972; Oiler. 1978). 

If oral language proficiency is a manifestation of 
linguistic competence, then functional language use 
reflects communicative competence. Whereas 
linguistic competence is equated with the mastery of 
the phonological, morphological, and syntactic systems 
of . a language-, communicative competence assigns 
priority to semantics. The emphasis gr\ meaning is not 
restricted, however, to the relationship of lexemes 
within an utterance; it encompasses all the 
supralinguistic features of discourse. Therefore, 
communicative competence can be considered an 
added dimension of linguistic competence; not an 
autonomous and distinct entity, but an integrated and 
homogenized expression of language in cultural 
contact. The adaptation of linguistic competence to the 
total informational input of the situation is the essence 
of communicative competence (Savignon. 1972). The 
critical adjunct of context involves all the macroenviron-.. 
mental factors that affect the communication act (Burt 
Dulay. and Krashen. 1982). 

Second-language assessment of communicative 
competence should ideally reflect; 1) a theoretic 
framevyork. 2) a functional language system, and 3) a 
model of bilingual education (Erickson. 1981). The field 
"of 'linguistics .impacts second-language methodology 
which, in turn, influences test construction (Upshur. 
1973). The linear trend, however, moves in a step-like 
progression. Although Carroll proposed the idea of 
integrative tests as early as 1952. second-language 
instructors, for the most part, have remained faithful to* 
structuralistic teaching and testing'techniques. Even in 
today's English as a Second Language (ESL) end 
Bilingual- Education instructional settings, discrete- 
point tests, are the most prominent (Day. 1981). If the 
development of communication skills is indeed a 
programmatic goal of ESL and Bilingual Education. then 
language instruments must be constructed that can 
validate this claim. . \ . 

The integrate testing movement is directed towards 
this end. iHis unified in its> holistic view towards 



language and its attempt to evaluate the use of natural 
language in social/cultural interaction. lt is divergent in 
its methods of ; measurement. Three methodological 
orientations have emerged within the last decade: 

1. Direct — the replication or use of a naturalistic 
. setting as a backdrop to language interaction 

as seen in the llyin Oral Interview (1976). the 
• Savignon Communicative Competence Tests 
(1972). and the Foreign Service Institute Oral 
Interview. \ ' 

2. Indirect the use of simulated or quasi- 
. realistic activities to promote language 

production as illustrated by the cloze 
procedure (Taylor. 1953) and the noise test 
. (Spolsky. 1968). . • . 

3. Mixed — the use of spontaneous language 
sampling techniques coupled with a discrete- 
point scoring system as evidenced in the Basic 
Inventory of Natural Language (Herbert. 1977) 
and the Bilingual Syntax Measure (Burt. Dulay. 
and Hernandez-Chavez. 1976). McCollum .and 
Day (1981) refer to this category as quasi- 
integrative. . t 

Noa. RusseM. and Silverman (1976) depict three 
domains pf language assessment: the social, the 
' linguistic, and. the>xommunicative. Each facet* of a 
representational cube is quartered so that communi- 
cation skills reflect listening, speaking; reading, and 
writing; linguistic structures denote phonology, 
vocabulary, syntax.- and semantics; and social settings 
represent the home, school, peers, and cojpmunity. An 
idea) model of communicative competence should 
embrace al! .these defined areas; an idea! language 
assessment instrument should explain the model. 

The global nature of communicative* competence 
makes language testing very 'cumbersome. No one 
measure can effectively capture all the' intricacies of 
language nor should it. try. Absent, is a filtering 
mechanism that retains those elements critical to 
successful language learning. Language assessment 
should be based'ori specific. orecTesignated needs of the 
population. Communicative cpmpfetence testing should 
demonstrate a student's ability to use. language in a 
variety of commonplace settings (Morrow. 1979), The 
remaining section of this papec will be devoted to how 
an Illinois school district has fac£d this issue. 

Tfie Measure of "English Communicative and Concep- 
. tuaT Achievement (MECCA) was developed to provide 
*ESL and Bilingual Education administrators and 



'practitioners with reliable data for instructional decision 
making. .It' is a comprehensive assessment system 
•designed for the.K-12 Limited-English-Proficient (LEP). 
student. Communicative competence is one of four 
sections; the composite score yields a profile of English 
cpmpetencies.. 

For the sake of practicality and feasibility, the 
perspective of communicative competence had to be 
narrowed. Considering the social factors affecting 
second-language acquisition; it was decided that input 
from the home, peers, and community could be 
obtained through parent and student interviews. Thus, 
the sociolinguistic variables related to language usage 
have come to be identified under three major headings: 
student information, parent information, and interaction 
• of student with others. A code sheet was devised to 
systematize the process of data collection. 

The focus for assessing communicative competence 
thereby became the school environment. More 
specifically, it centered on collateral learning (Dewey, 
1938) or the hidden curriculum (Jackson, 1968), those 
aspects of school life that are essential for survival 
within the institution, but are outside the realm of 
academic learning. This covert curriculum includes the 
rules, norms, and values that are internalized by the 
student in the culture-bound, atmosphere of school 
(Apple, 1979)'. 

■ m, : ■ 

Applying this notioirio language learning, the hidden 
curriculum can be equated with the context-embedded 
situations (Cummins, 1981a, 1981b) with which a 
student must cope within the school milieu. Thus, for 
the native English speaker who has had no experience 
outside an American classroom, communicative » 
competence would be cognitively undemanding; for 
the LEP student, whose language and culture are not 
consonant with that of the school's functional language 
use in English (Lo) would be cognitively demanding 
(Cummins, 1981 a,1981b). 

To accomplish the goal of measuring communicative 
competence, a list was generated of all the situational 
contexts a student encounters during the course of a 
typical school day. Each idea was categorized as an 
example of school .procedures, classroom practices, or 
^interaction with school personnel. v Qbjectives were then 
formulated for a task-based • performance test of 
listening and speaking comprehension (see appendix). 

As almost all language tests seem to consist of 
randomly selected items of nonspecified content 
(Farhady, 1982), planning for . content validity was 
deliberately incorporated into every phase. Preparation 
of a specification table was necessary to ensure the^ 
proportionate representation of product and process 
objectives (Thorndike and Ha^en, 1977). A 2X3 matrix 
was . created whereby the columns represented 
receptive, and expressive language, respectively, and 
the rows, the contextual categories. For each cell, six 
items were constructed; two for the primary elementary 
grades, two common to both' the~ elementary and t 



secondary levels, and two unique to the junior high and 
high school. * \ t 

A pool of receptivef and expressive language questions 
was formed, revised, edited, and checked against the 
original objectives. AIT items were then rank ordered in 
a logical sequence from the most simplistic to the most 
linguistically complex. Illustrations for the receptive 
subsection were incorporated as needed. 

Second-language-acquisition research has .confirmed 
that a silent period devoted to one-way communication 
precedes responsive oral language production (Burt, 
Dulay, andkrashen, 1982). In defining a developmental 
sequence for communicative competence, it is 
necessary, therefore, to proceed from receptive to 
expressive behavior. In the MECCA, for example, for 
receptive competence; 3 student is required to identify 
the illustration that shows "What do you use to write 
with?" For the expressive counterpart, he/she is 
requested to respond to this situation: "It's time to take 
a math test. You take out your pencil and it has no point. 
What would you ask your teacher?" ^ 

Learner verification was conducted with both LEP's and 
native English speakers. The Educational Products^ 
Information Exchange (EPIE) Institute's format for the 
analysis of instructional materials. -was adapted to 
assessment iQstruments. After taping and transcribing 
the session, the data was transposed onto this form. 
Based on the information gained during this phase, test 
items and illustrations were clarified. 

To facilitate the ease of scoring and to guide 
interpretation, a multiple-choice format was adopted as 
the scoring method. A four-point scale was developed 
that would accurately reflect a student's competence 
for the given situation: ' 

1. no response, 

2. inappropriate or incorrect-response, 

3. partially acceptable or incomplete response, 

4. appropriate and meaningful response. 

, As a varying, range, of potentially.jplausjble responses 
was acpeptable, students could draw from their 
personal linguistic and experiential, repertoires in 
answering the questions. f *The problem posed in this ; ... 
item, for example, lends itself to a variety of possible 
alternative solutions, 

"AMlfest week you didn't go to school. When you 
corms back, your teacher wants to know your 

excuse. yVhy were yoii absent?" 

..." 4 .. t , • ( ■ 

The' issue raised for discussion How can second 
language assessment capture communicative compe- 
tence? — . can now be Approached more definitively. 
Applying Erickson's (1 981) criteria to the 
communicative competence section of the MECCA, it 
can be stated that: - \ 

1. Psycholinguists and sociolinguistic theory 
underlie the development of the instrument. ■ 



2. The hidden agenda of the'schools offers ef rich 
array of interpersonal communication context 
for assessing functional language usage. 

3. Modification of the bilingual curriculum and 
the establishment of entry/exit criteria for the 
bilingual program will evolve. from the pilot and 

• field-test findings of the.composite system. 

Provision for social and cultural adaptability is as 
necessary to the LEP student as the promotion of 
academic achievement. Communicative competence 
demonstrates the ability of a student to respond to a 
wide range of linguistically diverse situations. However, 
it is merely one aspect of the complex language 
development process. 

Within the school domain. second : language assess-, 
ment needs to address total language usage- 
communicative proficiency in conjunction with concep- 
tual achievement. Communicative competence alone 
cannot adequately determine a student's academic 
performance in a second-language classroom. 
Comprehensive evaluation measures are requisite for 
the provision of appropriate and meaningful 
educational experiences for all LEP students. 

Appendix 

Language Contexts in the Schoof Domain: Communi- 
cative Objectives for a Task-BasesTPerformance Test of 
Listening and Speaking Comprehension 

I.. School procedures 

1. evacuate the building.during a fire drill 

2. require aR^pxcused absence . 

3. ask for early dismissal 

4. explain tardiness 

5. relate playground (campus) rules 

6. line up for entrance/exit 

7. . respond to schedule of in-service days and 
vacations 



*8. open your locker 

*9. / use the telephone . 
*10r describe your schedule 
*1 1 , argue against study hall in favor of an open 
campus j> 

II. Classroom practices • 

1 . * follow oral. directions (simple to multiple) 

2. sharpen a pencil * 

3. request supplies (paper, scissors, crayons, 
ruler) 

4. ask for a pass to the washroom or for a 
drink of water 

5. apologize to a teacher or peer 

6. use polite phrases and greetings 

7; paraphrase the principal's directive over 
the intercom 

8. explain the reason for no homework 

9. recall class behavioc code 
•10. argue With a peer 

11. salute the flag 
*12. explain 'cutting class' 

III. Interaction with school personnel 

1. nurse— explain what's wrong (injury or 
illness) 

*2. counselor— react to a job interview 

3. librarian or resource teacher— tell about a 
lost book 

4. . office secretary— explain a fight or incident 

5. lunchroom attendant— pay for a meal or 
obtain a ticket 

6. custodian — relay a message from the 
teacher 

7. bus driver— give your new address ql the 
route to a friend's 

8. principal— supply requested information 
*9. dean — respond to a suspension for 

smoking 

# 1 0. P.E. teacher— react to taking a shower 
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A MODEL FOR ASSESSING COMMUNICATIVE PROFICIENCY 
IN A SECOND LANGUAGE 
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The field of language assessment has witnessed much 
advancement in the last decade, Practitioners involved 
in teaching a second language are developing an 
increasingly accurate view of* what constitutes 
proficiency or competence in a target language. Despite 
these developments, however, we lack consistency in 
both the approaches taken to second language 
assessment and in the actuJfests that are available 
through the market. One fl^the reasons for this 
inconsistency in h6w second language proficiency is 
assessed may well be the lack of a conceptual 
framework that integrates a definition of language 
proficiency with pedagogy and assessment. 



The above considerations necessitate a system of 
communicative proficiency assessment that takes into 
account the initial steps before assessment begins and 
the steps to be taken after assessment is completed, A 
system of communicative proficiency assessment 
should have as its basis a description of what 
constitutes successful communication in a given 
situation (see Figure 1), For example, what characterizes 
successful language use for a teacher in a bilingual 
classroom, for a student in a target-language classroom, 
or for an adult in a specific work situation? Questions 
such as these should form. the basic premise of 
communicative proficiency assessment. 



Figure 1 : • • 

A Model of Communication Proficiency Assessment 
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The first section of this paper is an attempt to identify 
characteristics of language in use (the top box in Figure 
■ 1). This will serve as the foundation on which an 
assessment system may be built. 

CHARACTERISTICS OF LANGUAGE PROFICIENCY 

To begin, we need a comprehensive description of 
^language proficiency itself. As Brown (1980) indicates, 
development of a valid assessment system first requires 
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a good definition of language proficiency and how, it is 
acquired. The latter area has received ample attention 
in the field of second language research. Extensive 
research in first 'and second language learning has 
indicated that we learn language^by a process of 
creative construction through which we formulate 
hypotheses^bout the language being, learned and we 
apply and modify those hypotheses on the basis of 
interaction with the linguistic environment that 
provides the context for learning (Brown, 1 980; 
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McLaughlin, 1978). By a series of hypothosis building 
and testing in the context of communicating in tho 
target language socond language learners are said to 
pass through systematic stages, or interlanguages 
(Selinker, 1972), that progressively approximate fluent 
"native-speaker" language. 

Although the second language learning process has 
been fairly elaborately described, it is not clear what 
constitutes proficiency in a language. Two major 
aspects of language proficiency have been identified 
and'discussed in the literature. Although different labels 
have been used to refer to these two aspects, the basic 
notion of linguistic proficiency, on the one hand, and 

. sociolinguistic proficiency, on the other, emerges. Many 
writers (See for example, Canale and Swain, 1980, 
Erickson, 1981) have referred to the two aspects as 
fdrm and function. Others (Bachman and Palmer, Note 
1) have fjeferr^d ' to grammatical competence and 

* sociolinguistic "corri^etence as two constituents of the 
larger concept of communicative competence. In this 
paper, I will^adopt (with modifications) the terms used 
by Bachman and Palmer and refer to the general 
proficiency in language as communicative proficiency 



and to the two components as linguistic proficiency 
(form) and sociolinguistic proffciengy (function). 

The history of the development of a conception of 
communicative proficiency has been marked by the 
continuing debate between proponents of ei model 
which emphasizes a general factor of language 
proficiency (see for example, Oiler and Hinofotis, 1980) 
and those who maintain that communicative 
proficiency consists of a number of distinct 
components (such as Canale and Swain, 1980). While, 
the dichotomy between the single-underlying factor 
model and the multiple-component model may exist at 
,the theoretical level, one view need not preclude the 
other. The separate components of communicative 
proficiency may be seen as feeding into onejgeneraj 
underlying language ability. In fact, there seems to be 
empirical evidence supporting both views: Oiler 
maintains that one general factor accounts for the 
greatest'proportion of reliable variance in language test 
scores, but in addition, other researchers (Carroll 1980; 
Bachman and Palmer, Note 1) have shown that more of 
the variance in language test data can be accounted for 
when information from specific language traits is taken 
. into account. 



Figure 2: 

A Model of Communicative Prof iciency 
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THE FUNCTION OF LANGUAGE 



A framework -of a genorol communicative proficioncy, 
consisting of two components, is illustrated in Figure 2. 
The conceptual Immework prosontod in this paper 
borrows from the Bachman and Palmer model, as well 
as that of Canale. aryf Swam, In both models, it is 
hypothesized that communicative competence consists 
of several components which measure distinct 
underlying abilities, In an attempt to avoid any overlap 
between the components of communicative proficiency 
and in order to illustrate the relationship between those 
components, the framework in Figure 2 has been 
modified from the original preseircttion (Bachman and 
Palmer, Note 1). Tfcie term "proficiency" is used in the 
present paper instead of "competence*' in order to avoid 
the possibly confusing distinction between compe- 
tence and performance. Proficiency will henceforth be 
used to indicate mastery as it surfaces via performance 
on a given task. Secondly, the various components of 
communicative proficiency are categorized into the two 
major aspects, linguistic and sociolinguistic, corre- 
sponding to Erickson's form and function, respectively. 
This is a departure from Canale and Swain's model 
which includes; 1) grammatical competence (our 
linguistic proficiency), 2) sociolinguistic competence 
(our sociolinguistic proficiency), and 3) strategic 
competence. The last component has been excluded in 
the present paper as a separate component and is 
rather seen as part of a^more general communicative 
ability. As Bach-nan and Palmer argue, strategic 
competence comprises an ability which affects .all the 
components of communicative proficiency that are 
actually put to use. The present conceptualization of the 
two aspects of communicative proficiency also departs 
from Bachman and Palmer by classifying pragmatic 
ability as a component and not as one of the major traits. 

Linguistic Proficiency 

This aspect of communicative proficiency, which has 
been the focus of much of the earlier writings on 
language proficiency, includes mastery of the sound 
system, the grammar, and the pragmatics of a language. 

The first component, the sound system, includes the 
mastery of the phonological system as well as the 
prosodic features of a language. The latter are important 
features of language that are frequently neglected. 
Non-fluent speech is often characterized by the usfe of 
inappropriate intonation or stress which may have a 
strong negative impact on communication. Bachman 
and Palmer exclude phonology from their model, 
arguing that it acts mcjre as a channel than a 
component of .competence. However, second language- 
users clearly develop varying levels of proficiency in the 
sound system. Consequently, the effectiveness of a 
message — both connotatively and denotatively — 
would vary as a function of that level of proficiency. 

The second component of linguistic proficiency is. 
grammar, vyhich includes mastery of the morphology as 
well as the syntax of a language. This component -has 
been the focus of many tests of second language 
proficiency arid has received ample attention in the field 
of language testing. 
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The third component is pragmatic competence which 
Bachman and Palmer associate with the ability to 
express and comprehend messages. Included in this 
component are mastery of vocabulary (hence, the 
placement of the component under the linguistic 
aspect) and tW ability to* process cohesively organized 
utterances. Although this set of abilities hea been 
included in the linguistic component, it is nevertheless 
very closely related to the sociolinguistic component. 
This interrelation of linguistic components and the 
sociolinguistic aspects is to be emphasized and will be 
discussed later. 

The relation among the three linguistic componeivs 
also needs to be discussed. Proficiency levels in tne 
three components — the sound system, the grammar, 
and pragmatics — are generally highly correlated. 
However, as Cziko (Note 2) has pointed out, most 
research in this area has used correlation coeffe|£©nts 
between pairs of language measures; thus, indiv^u^l 
performance is compared to a group as in a 
norm-referenced approach. A person's proficiency 
relative to meaningful "absolute" criteria has not been 
measured. Thus, a non-native speaker, may have 
native-like proficiency in grammar, but may have a very 
low level of proficiency in the sound system of a target 
language. In fact, informal observation of many ethnic 
communities in the United States leads me to believe 
that the above description accurately fits a significant 
number of second language users. This could result in a 
high positive correlation between the two language 
measures, grammar and sound, which will not reflect in 
any way, the (possibly significant) difference between 
the absolute levels of proficiency in the two measures. 

Linguistic proficiency has received c most of the 
attention in the area of language assessment. Most 
language tests on the market focus" solely on 
components of linguistic proficiency, with disregard to 
the otheo the sociolinguistic aspect of communicative 
proficiency. However, even within this restricted arena, 
the method of assessment varies dramatically, Two key 
issues in language assessment are whether language 
should be tested by discrete-point versus integrative 
tests and secondly, whether language should be tested 
directly versus indirectly (see Cohen, 1 980). 

Concerning the first issue, most marketed tests have 
traditionally taken a discrete-point approach, where 
language (or, in this case,' linguistic . proficiency) is 
broken down into its component parts and each one of 
those parts is tested separately. Thu&r^the ability to 
process subject-verb plural agreement (the sheep is 
jumping vs. the sheep are jumping) or the ability to 
discriminate between.,' pninimal pairs (ship-sheep) is 
tested. It has been suggested (Brown, 1980, Erickson, 
1981) that .the rationale for this, approach is basec^ on 
structuralist and behavioral psychological theories that 
view language as the sum of Its parts. The 
discrete-point approach has met with criticism, 
particularly in light* of the view that language is a , 
synergistic phenomenon and that application of. 
linguistic skill requires integration for its pragmatic use 
(Brown, 1980). In view of the latt&r formulations, we 
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need to refocus methods of assessment from the 
exclusive sampling of discroto-points within longuoge 
to the inclusion of integration of different skills. We ore 
•witnessing such u change in current testing practices. 
9uch that integrative ways of assossing linguistic 
proficiency are being added to an increasing number of 
tests (for example, the Woodcock Ldfiguage Proficiency 
Battery). 

The second issue in language assessment method has 
concerned the dichotomy between the direct and 
indirect ways of testing linguistic proficiency, A direct 
test of linguistic proficiency samples the actual 
behavior being evaluated, while an indirect test 
samples behavior through a task that is different from a 
normal language-using task [Stoltz and Bruck. 1976). 
For example, if a student's ability to use the past tense >. 
in speech is to be assessed, a direct way of assessment 
. would be to ask the student to give a talk in front of the . 
class; an indirect ^vay wouldt be to give him/her a 
'multiple-choice grammar test with specific items on the 
past tense. Traditionally, tests have taken an indirect 
approach to assessing oral language proficiency, 
usually, in multiple-choice formats. Possibly, this is due 
to "paper-and-penciT tests being easier to control, 
administer, and validate. However, there may be a basic 
problem with assessing oral language through indirect 
means since oral language is a chameleon-like 
phenomenon, in constant change (Tarone. Note 3). 
fferent types of speech elicitation methods produce 
fferent types of errors in second language users 
(LoCocp. 1976). For example, translation into the 
second language leads to a higher frequency of errors of 
interference from the native language than does 
spontaneous speech. There is an increasing tendency 
for language testers, then, to assess oral language 
proficiency in as direct a method as possible. Cohen 
(19B0) has observed that tests lie on two continua. the 
first from the most discrete-point to the most 
integrative and the second from the most indirect to the 
most direct. Research suggests that test makers should 
adopt the latter extremes of each dichotomy. 

Sociolinguistic Proficiency 

This aspect of communicative proficiency concerns the 
social aspect of language use. Use of language, or 
communication, is very strongly affected by social 
constraints. Language is always used in a context, and 
consequently, oral language must be assessed within 
its appropriate contextual framework. Hence, this 
aspect of communication should be included within a 
model of language assessment. 

There is ample evidence in the literature that language 
use varies as a kinction of various social and contextual 
factors. Ferguson (1959) identified twelve different 
speech functions related to the setting of the 
interaction, the topic under c^iscussion. and the 
participants involved in the communication act. 
Fishman (1965) discussed the domains in which the 
form of language use varies. Bilingual persons often 
report an inability to convey messages (effectively in- 
certain topics (for example discussing psychology vs, 




discussing the weather), or to certain people (talking to 
one's parents vs. talking to one's colleagues) even in a 
language in whiclf* fHey are generally very fluent. 
Morrow (1977) defined receptive sociolinguistic 
proficiency as being the ability to gauge spoech in 
terms of; 

1) the setting to which an utterance is appropriate, 

2) the topic being discussed, 

3) the function of the utterance, 

4) the attitudes conveyed by the speaker. 

5) the presuppositions behind the utterance. 

6) the role or the status thajkrtbe speaker is 
adopting, / 

7) the level of formality of ther jtterance. 
B) the mood of the speaker 1 

The same skills listed above for the reception of oral 
messages should also apply to the production of oral 
expressions. Basically, we are concerned with the'use of 
• the contextually appropriate' register or code. These 
skills are probably very much affected by cultural 
'✓constraints or cultural norms of acceptable behavior. 
The register that is used in Arabic, for example, to 
address the elderly is slightly more formal than the one 
used with younger peers. This reflects the deference 
that rs accorded to the elderly in the Arabic culture 
(Berger, 1962) Bachman and Palmer (Note 1) include in 
the category of sociolinguistic competence the 
following sub-traits: the ability to distinguish registers, 
mastery of nonliteral figurative language, and use of 
relevant cultural allusions. 

sum. oral communicative proficiency consists of a 
VcRngfomeration of skills, both ftqguistic arid 
sociolinguistic in natuce. These two components of 
communication have been discussed separately, and 
the skills that Constitute each type of proficiency have 
also been singularly identified. Nevertheless, the 
interdependency of these separate skills cannot be 
sufficiently stressed. The potency of a message 
depends on the interlocutor's* linguistic proficiency, 
which itself would determine and. in turn, be 
determined by the interlocutor's sociolinguistic 
proficiency. For'example. a persdn's pragmatic skills, 
the ability to use target language vocabulary (a 
linguistic skill), would constrain the appropriateness of 
the register used (a sociolinguistic skill) and in turn, 
knowledge of the norms for appropriate language use in 
a specific social context (a sociolinguistic skill) Would 
determine the choice of one set of words over another 
(vocabulary, a linguistic skill). The effectiveness of 
communication, then, may have to be judged relative to 
the function that language is supposed to" serve in a 
particular context. An analogy comes to mind: if you are 
asked to leave on a long trip with only three*' items as 
luggage, where you are to go and what yeu'are to do 
there would determine .to a large extent what those 
items are to be. A fishing trip would require a very 
different set of clothing than, for example, a formal 
business trip. We need to know how the target 
language is to be used, in order to be able to. first, teach 
it, and second, to assess proficiency in it. Thus, we 
return to the question asked at the beginning of this 
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pnpor: what linguistic tind sociolinfluistic skills am 
needed in order lor a person to survivo, bo accepted, 
and porhopn excel in a given situation? 

SETTJNG STANDARDS I OR LEVELS OF PROFICIENCY 

Tho question romoins as to what characterizes 
successful use of a language, Whon a porson doos not 
possoss native-like fluoncy, as may bo tho r caso lor rrjost 
socond-languago users, the need arisos to identify 
thoso skills that are most important for successful 
communication; that is, wo need ■ to know how 
proficient is proficient enough. Further, since verbal 
interaction takes many forms and shapes,. we need to 
know what it is that allows a second-language user to 
survivo in a specilic verbal interaction or a specific 
linguistic environment. The following questions seem 
appropriate. 

1) How proficient must a limited-English- 
proficiont student be to survive in an 
all-English classroom? 

2) How proficient must a bilingual teacher be in 
order to teach in a classroom where the target 
language is used as the medium of instruction? 

Questions similar to the above may be asked for the 
whole gamut of language environments and 
communication acts in which language users engage. 
F.or instance, how proficient must a limited-English- 
proficient person be to work at a hospital where English 
is the common language of communication, or to 
become an effective salesperson? 

The answers to the above and other similar questions 
should then be used as guidelines for setting standards 
for minimum levels of proficiency in a language. This is 
the second step in establishing a language assessment 
system (right-hand-side box in Figure 1). 

When standards or criteria for proficiency in a language 
are being established, the issue of levels or degrees of 
"success" needs to be addressed. At a minimal level, 
successful language use may refer to simply getting a 
message or an idea across. At a more exigent level, 
successful language use, in addition to getting the 
message across, may refer to being accepted or 
positively regarded by theinterlocutor(s); this becomes 
especially pertinent fqr a non-native speaker from a 
linguistic minority intera^ftig with a native speaker 
from the majority populattor^ 

To arrive at standards for a minimal level of proficiency, 
• it would be helpful to examine the research that is 
currently being carried put on language as it is used in a 
specific context. 



Standards for a Minimal Level of Proficiency 

Although research is currently being done on language 
use in different situations, very few such studies have 
assessment as thfir stated goal. Research in 



spcond-languago looming and uso and rb9oarch in 
socond-languago assessment have boon developing 
indepondonlfV from ono another. Clasoroom-centGiod 
rosoarch is n aood oxample of this autonomy: tho 
questions boing asked in that aroa of rosoarch — 
regarding tho typo of language that is used in tho 
classroom or the type of toachlng strategics used by 
teachers — havo not been concerned with, setting 
standards for survival in those particular environments, 
although as Allwright (Note 4) has pointed out, the view 
that tho classroom is a setting for language acquisition 
has brought the field closer to second-language 
acquisition studies. The research findings in those areas . 
can still be" useful, and it would behoove researchers in 
assessment to examine that body of literature to obtain 
descriptions of language used by different types of 
interlocutors in different language environments. The 
following areas of research may provide useful sources 
of information for researchers in assessment. 

1) For students who have a limited proficiency in 
the language of instruction, the question to be 
asked regards the type of language that is used 
in the classroom by the teacher as well as 
native-speaker peers. It has been suggested 
(Cummins, 1980) that for limited-English- 
proficient students, some type of formal 
academic language skills — what 'Cummins 
has called Cognitive Academic Language 
Proficiency (CALP) — is needed in order for a 
minority child with limited proficiencyin the* 
majority language to survive in a majority 
classroom setting. CALP skills have been 
defined as being context-free or context- 
reduced. Empirical work is needed to identify* 
test, and further characterize what those 
particular skills might be. Standardized tests, 
normed on native speakers of English, are not 
suitable for limited-English-proficient students 
because we do not know for a fact that a 
non-native speaker needs to be as proficient as 
a native speaker in order to be an effective 
language user within the classroom. We return 
to the initial question set forth in this paper: 
what is the smallest amount of linguistic and 
sociolinguistic baggage that a child requires for 
a given "trip"? In effect, .what type of language 
is used among peers in the classroom; what 
kind of student-teacher and teacher-student 
interactions take plgce; beyond the^stassroom, 
what language does a child need in the 
playground? Classroom-centered research prO- 
, vides a first step in answering these questional 
Cathcart (Note 5), Lightbown "(Note 6), and 
Schinke (Note 7) have examined the language 
that students are exposed to in a classroom 
where a language other than their dominant 
one is used. 

2) For bilingual teachers who will be teaching in a 
nondominant language, it is essential to know 
what type of language and what type of 
teaching strategies are needed to convey 
information to students at different grade 



lovelS: The research by Allwright (1080), 
Cathcart and Olatm (1076), ChAutlron (1977), 
♦* -Hamayan and Tuckor (1000), unci Johnson 
(Notu U) should bo holpful in delineating Ihoso 
skills, (for a review of this literature, sue Lony. 
1080) 

3)' For adults looming a second language, Wf3 
must know tho specific language skillu that aro 
needed to function in dilloront language 
contexts. For tho non-nativo speaker who will' 
bo* attending a university in a nondominant 
language, tho question is: what r typos of 
language skills are needed in orjdor for a 
student to survive in a university setting where 
his/her nondominant language is used as the 
medium of instruction? Tho work of Allwright 
(1980), Bailey (Note 9). Gaies (1977), and Long 
and Sato (in press) would help provide answers 
to 'this question. For the non-native speaker 
who, for example, will be working in a hospital, 
the question regards the types of language 
skills that aro needed to carry out tasks 
involved in the job. Educators involved in 
English for Special Purposes curricula and in 
notional syllabi took this approach many years 
before language assessment researchers got 
involved in it, (However, as Widdowson (1978) 
notes, while the notional syllabus claims to 
develop communicative competence within 
.the design of the syllabus. , this goal is not 
always achieved; the notional syllabus still 
takes the approach that language is a 
conglomeration of units.) It would be beneficial 
to those of us involved in language amassment 
to examine those Syllabi and the tests that 
accompany them. 

^ SOCIAL STANDARDS FOR LEVELS OF PROFICIENCY 

Social aspects of language use ar t e crucial for a 
comprehensive system of communicative proficiency 
assessment. Lambert and. others (see for example. 
Gardner and Lambert, 1972) have pointed out the 
importance of language as a variable that interfaces) 
with attitudes: attitudes are crucial in language learning 
processes; and language use, in turn, is important in the 
formation of altitudes. To date, there is ample evidence 
concerning the facilitative role that positive attitudes 
play in second-language learning (Tucker, Hamayan 
and Genesee. 1976). More recently, there i§ a growing 
body of evidence for speech, or language use, as'a 
strong marker for group identity and for the formation of - 
value judgments. The way you speak marks you as 
belonging to one ethnic group or another, as. coming 
from one social class or another. £nd as having ona 
educational background or another (Hymes. 1972). 
There'is evidence (Giles, Bourhis and Taylor, 1977) that 
people use speech to form attitudes toward others and 
they depend on' speech as a marker to such an extent 
that it may. even supercede information regarding the 
other person's' A race (McKirnan, Smith and Hamayan, 
Note 10). fc \ "i 



These findings aro important to take into consideration 
hot!) in touching tho various aspects of Uinguagtt that 
aro doomed necotmnry for aucctiaaful communication in 
a specific situation and in aiming tho readiness of a 
non-rfativo spoakor to plunge into thoso nituutiona 
whoro tho target languogo is the medium of 
communication, Thoso social iasuos must then bo tokon 
into account in establishing tho criteria for successful 
uso of a lungu&kwJiYthis caso, success may bo defined 
in terms of getting a message across and in terms of 
being accepted' by members of a group who are 
proficient in that,targot language. In a recent study, 
Dailey (Note 9)' found that non-native-spoaking 
university teaching assistants' accents, to a largo 
extent, accounted for native-speaking students' 
evaluations ol their effectiveness as teaching 
assistants. It is important, then, to train our language 
students to sound native enough and prestigious 
enough ,— whether that be linguistically (sound, 
grammer, pragmatics) or- sociolinguisticajly — so that 
not only does the message get across, but the speaker 
is actually accepted and evaluated positively by others 
• in his/her immediate environment. * * 



SUMMARY 



Tftus far, language assessment procedures fall short Q* a 
comprehensive model that takes into account the 
nature of communicative proficiency, including both 
linguistic and sociolinguistic aspects, There seems to 
be a need to describe successful' communication in 
different language environments and to set up criteria 
for needed linguistic and sociolinguistic skills, 
considering the social implications of speech that falls 
below the proficiency levels of a native-speaker. 
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THE I DE A OR AL L ANG U AO E PROFICIENCY jESTi A CRITICAL REVIEW 



> -^'Pamela A McCollum • 
The University of Texas at Austin 



H - The purpose of this article is to analyze the IDEA Oral 
Language Proficiency Test (IPT) which is published by 
Ballard and Tighe. Incorporated of Whittier, California. 
The IPT is pairt of the IDEA program which consists of 
the IDEA Oral Language Proficiency .Test and th.e IDE/Vw v 
Oral Language Management Program of Developmental 
English Activities. This program is Jh" innovation in that 
it is the first language proficiency .irest which prescribes 
a specific language 'development .program for 
Limited-English-ProficienU(LEP) students based on its 
results. . • ' .'. . 

ResuKs from the IDEA Oral Language Proficiency Test 
are used to place LEP students into a language 
development prcfgram which is purported to 
correspond to their appropriate level . of language 
proficiency and which will provide them with the.skills 
which are necessary for reading, The IDEA'iprpgr.am has 
been state-adopted in California and Texas- and is 
widely used in .Colorado; New York, Arizona Oregon, , 
Washington and r Hawaii. \A/hile the appeal of .such. a r 
self-contained program to teachers and administrators 
is obvious, ^close examination of the program^reveals; 
that both.*ts claims and .the theoretical .base upon 
which it is founded must call itjs use as an oreljangtijige 
\ proficiency measure for limifed-Engli$h : proficient .' 
students into question. : . 

Development ' . • v 

J The ^authors state that the IDEA - Qr&l Language*. 
Management Program was developed in response to a 
need for a developnentall^sequenced approach to <&al 
language skills for first iand> second language l^rriers 
which would prepare them for reading. As a 'result, 
Wanda Ballard and Phyllis .Tighe, two experienced 
a* California classroom teachers, developed, a sequential 
program of English activities which wpuld; give 
students a strong basejn ora! language prior to reading. : 
Ballard and TigKe found that L2 learners afr well as 





^providing? mstructit^ in. the areas of vocabulary, 
comprehension, 'syntax, and verbal .expression at seyen 
language skill levels." " 

. The JPT was developed af^er the IDEA Language 
Management Program was implemented. The authors, 

r Waqda . Ballard, Phyllis Tighe and Dr. Enrique F, Dalton. 
w^j^Gted as the .technical advisor, state thaj the 
^iBI^.of the I PT is "to- determine the level of oral 
langua^e^na^terv as it . relates . to the IDEA Oral ^ 
Langua^^a^gemenl Program and to accepted levels 
of NE|?ffl|p/fEP (non-English-pr&ficient/limited- 
English-prdrfcient/ and." fully English-proficient) 
classifications\(Dalton,;i980, p. 5). The IPT is available 
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in Spanish, Portuguese and {English and is to be used in 
kindergarten through eighth grade. ] 

Test Format — ■ 



The English version of the test has two forms, A.an^Bj^A'- 
which consist of eighty-three items which are sard $*f'' 
measure proficiency in the areas of vocabulary,. syhtak;||.:. 
comprehension and, verbal expression. Thirty-five of thie ^ / 
test items require "students to respond to questioi^.;; : 
related to cartoon drawings in the test bpokl^t.^e 
remainder of the iteroS require students to perforrt^spc.h i - • 
tasks as respoptimg to commands to indicate ^ 
comprehension, discriminating between minirhal OTirs,; .' 
retelling , a 'short story, describing an object /and : 
selecting the main ideas from a short passage which is 
read to them. The Spanish version pi theJPT is .not a- 
translation equivalent of the English version and is'tinly 
available in one forrn. 



The : fPT .djyicies . language proficiency into seven 
developmental : levels A-F and- M. The/' content ^ 
knowledge and language skills which students are to 
# posseSg^'ed&h level are specified and corre$|)qpd to . : ^ 
. ; the^mp^aiV^thr/lDEA^Oral La nguage ' Management ■ ■ , ,f 1 
■*$bg ra'ri^l^s %9f& Ta b te ^ The, test Items at each' 
'■vlWeU^ trip Students' ability in ' 

^vocabulary, ^io^rrtprehension, syntax ' and verbal v 
,expres , sioft. v V . " : >■■?■' ' . 

' h. V • "«t:4 . • ™ .. 

The IPT differ^from "other language proficiency t<3Sts in 

. ttl'an- it ^divides . language proficiency into seven' 

. developmental levels. It alsovdiffers . in thatojt posits . ' 
verbal, expression as one of the dimensions of language " ; 
proficiency. The authors never actually state which > 

' language "skills comprise jflpl expression* although 
articulation skills are saiq^W be, included in it.' An 
examination of the ty pes of items^which.are included in 
the verBal expression category in the Rationale of Test , 
Items (see Table 2) s hows that jhisjcat^gpry . contains a 
nuo^er of different types /Of items'; They /include 
sentence mpeti tion^the prog^tion of . th^ future aritJ . 
past'tiens0^iscrimi.nation*6f ,/nihimal. pairs&^wc^iptive y 
skills (where one is required to describe aipenci'l), and ;< 
the identification and retelling of main id^as from a ■ .. 
spoken paragraph. It is not clear why these items were, . 

"' not placed . in one of the three categories-vocabulary, \ 
comprehension or syntax or how the IPT scor ng system . 
reflects one's articulation skills. ' . .* 

The choice and assignment of test items by\kill area 
' and. developmental level at -times seems illogical (see 
Table g/2). Test items across, a skill area do not 
consistently reflect a progression, from less to more 
complex in terms of the type of tasks which students 
are asked to perform. For example: in the verbal 
^expression skill area, students have to perform sentence 



.' repetition tasks^t -bevel M E)- yitiile they are required to 
discriminate between minimal pairs '^(he following 
two higher- levels of language proficiency. At Level R 
whfth is the^highost level which coptainb test items 
(Level M has no items but : indicates, a * student 



responded correctly to 80% of the items on Levels B-F) 
in the verbal expression category, 'students are asked to 
discriminate between minimal pairs and select and 
retell themain ideas from an orally read paragraph. . 



Table 1 



V 



: ; . •Summary of the IDEA Proficiency Test Content 

A LEVEL A student knows less' than 50% of skills listed in LEV^ B. ' 

:• ' '• ' .... .' . . , ••' - • < . • 

A LEVEL B student can: 

'. * • " *• . , ■ ' ' ■ ■ ■ "• 

1. telj his name,and age. - : . . ' 

... 2. identify family and common school personnel, classroom objects, basic body parts, commqn pets and fruits: 
3t 'use present tensgv«f.b M to be". fc 
'4.. -Vse regularplu^*)P* : /-- 
f&; 5: u answer simple ye$/no" questions appropriately. 

6. follow simple directions involving basic positions in space. 



A LEVEL C student can: 



1. identify common occupations, clothing, farm animals, a 
\ . 2; express himself using the present progressive tense (he /s work/np) of common verbs. 
3. a use cpniunctions-and negatives correctly. 
y ; A. follow the i teacher's directions related to identifying positions on a page. r 

* 5 ? « repeat simple sentences correctly/ ^ , . ; 

6. ^comprehend and remember major facts of a simple story. . ,;i ' < 

1 ' . ■ .: . ' •■' . . . , .: ■ ' . " . : • '. 

A'LEVE-L D student can : 4 

* ■'' 1. identify common modes^trarisportation and householditems.. • 4 " 

2. name the days of the week.' 

■■ 3. describe common weather conditions. v r ■ 

4. use irregular plurals and possessive pronouns c^fectly. ^ 

5. ' ask ^jmple future tense questions. ' ® 

6. understand arid express comparative and quantitative concepts. , 'gvg 
. $7. understand and name opposites of key words, * 

8. follow dicections of teacher involvi^ movement in space. ■ 

9. repeat complex sentences correctly. 
10. understand and identify moods in a simple story. 

. . 1 1. express himself using the present and future tense. 

A LEVEL E student can: , 

' 1 . identify mon^^r^^nimals, and comrrion;tools ; . - - : •/ ■-; 

2 ' -„u se-. s'u pe r I art'^'^^il^^lj^^^r^ e n s'e correctly: — ™ ------- '$«{& r 

. 3. understand .an^^^ 

\ ask past tense'q^sy^q^'0 " 

5. discriminate diffefen^¥s in closely paired words. 

6. describe and organize the main properties of common objects' 

A LEVEL F student can: ' : ' 

1.' identify the seasons and unusual occupations and animals. 

; 2. use conditional and past perfect tenses of verbs. 

* 3. discriminate fine differences in closely paired words. 
\ 4. express himself using past tense correctly. 
? V 5. comprehend and predict the outcome of a story. . 

6. - recall and retell the main facts of a story. 
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Table 2 



IPT Test Items by Skill Area and Developmental Level 



LEVEL B" 



LEVEL C 



LEVEL D 



LEVEL F 



3. People 13. 

4 School' r. .14: 

5. Body Parts • ' • - ^ -V^f 

6 Animals. Pets . 

7 FooU FruttandVegotab^4^\ ; « 

••.-•■ v v .■■mvtii. 



■ People: Occupations 30, 

Clothing and Accessories 31. f 

Animals: Farm . . 32. 

Food. Common 36." 



Transportation 
^Household Items 
Adjectives: Weather 
Calendar: Days of Week 



*\$C'Stfhttf\,/ t i :'. ' . 66. 

50. Animals: Marine ' r f&- * 

51. - Miscellaneous Items ,'. . •• . ^ • 

52. . ..Tools ""X. ! ' \.J&'« 



People 'Occupations 
Calendar, Holidays 
festivals. Seasons . 
■ Animals . * y"'' 



5 



8 



SpatialCopcepts 
> Space Prepositions «^? ( 
9 ,r Commands / : 



21. 



: 22. 



•;••;< : 27 

29 J 



SpatiaLConcepts: 
.Directionality on Page 
Commands^ '" 

Story: Factual Recall 



34. Comparative Concepts 

37."| Opposite Concepts 
38 J 

39. • Spatial Concepts: • 

Directionality in Space 

43."] Story; Determining Mood 

44 I i? 



55. 

56.J 

57. 



Opposite Concepts 
Time Concepts - . 



78 T 

79 J 



Story. Predicting 
Outcome . ; 



;io: 

12, 



Yos^N6 T fl&sponse\ 
RoS$!*u.^lurolsj .. 
Verb, "ibbe". \ 



■4m 



• . ' 17! Pronouns: Subject 

18 Verbs: Present progressive- 

/ 19. ^lass Nouns * . . 

20.'" . Conjunctions 

$3.-. i Negative Statements 



33. > Verbs: Habitual Present 
35. 'Comparatives -*:' 
40, Ouestiohs: Future Tense 
45. Irregular Plurals 
46... Pronouns: Possessive 
- X 



53. 
54. 



58. 



Superlatives '■■ 
Verbs: Irregular 
Past Tense 

Questions: Past T§nse 



69. Verbs:' Conditional' 

70. Verbs. Past Perfect 



■■■■ A 



o 

V) 
V) 



> 



!:*•"'" ■ *. 'y* 4V"| Sentence Repetition 

] /.■:♦.' V 1 ;? V 24.1 Sentence ^epetiltlon. v *42. J 5 

'Sufv&ai^pons^s'.^ , '. ' I' 25 j yj&^- ,« ,* -'^vV^^ '^ 4*1 Use of Future Tense 



59., 
60.- 
*61. 
62.. 
63.* 
64. 
65.. 



Auditory Discrimination 



Descriptive Skills 



71. 
72. 
73. 
74„ 
75. 
76. 
77. 
80. 
■81. 
82. 
83J 



Auditory Discrimination 
Use of Past Tense ; 



Story: Retelling 
Main Ideas 



"LEVEL A Placement is ind'ica^ v ■' ■ • * , 
** includes Articulation Skjfc' 

' «1979 B*H«rd 8. T^«. Inc. « . ' ? - yj\ i ■• • l&ii.v. •••• . ' 

Dalton. Enrique F: THeffi Oraltan^ Tesifochnigal Manua/VJhHUer. Califc'Ballard & Tighe. Inc.. 1 979. 



.The • j uxtajpO^ .thrtfugh .language skill 

areas .at ,eachvr } ^ level is a J so m 

inconsistent proficiency, students are asked . v 

;to repeat' ixh^^/^fah^ wefek tautest vocabulary, to '^A 
, ;^peat senterKJ^|^:g measure of verbal expression and 
• i6 determinefrti'o^6f\from an orally read paragraph to test 
/ xompr^hensidi^^hije the first two tasks may be of 
equatdjfficu^ mood of a passage is a 

mor|f "difficyiV-tasic and^.one Which is of a different 
m agi> i t ii d e, -I. n ^ cfdi t i p n A n order to perform such a task, 
a :£tu^ Understand and be able tp make ; 

iniferehces frortr.'la^ skills are, generally 

^quired rthroQg^'jn'structioh and are not related to 
^Qne's.abiiity tp'^fnmunicate in a language. Since some 
;^tiatiye speakersfdf English who are urleducated would 
Shave diffiCultVcirj; performing such a task, it should not 
be expected;^?-' L2 learners. It is better suited for. a 
: reading achievement test, rather than'^ test of language 
v proficiency: 

there ifd also problems with the language which is 
used irfthe IPT test items. Many times, the directions or 
cues which, accompany a test item are too difficult for 
the level R^l^nguage proficiency which is being tested. 
For examrM' : oh Level C of form A. as the teacher points 



to a picture of cake and ice cream, she says, "See the 
cake and ice cream? These go together at a party. On 

; my 'birthday I have cake and ice cream." Then th6 
teacher points to a picture of.a table and chair.and says, 

i "These go together at school. Tell me what you have at 
school. "I ..." The language in mariy of the test items 
needs to. be . more direct; briefer and made more 
appropriate to the language-proficiency level which is 

' bQin^ tl^ed.cA child at LeveJ C proficiency could 
conceivably be abf^ to correctly respond with "j ,|)ave a 
tabla^and a chaiir^iiut be unable to get, past the* 
language which was diesigned to elicit it; 

Scoring * ':\fj'*~ 

Responses to^est items are scored for apprdpriftehess 
and completeness. This means that with the exception 
of items which require the identification of lexical items, 
• questions must-be answered with full sentences in" 
order to . rieceive credit. In some cases, if the child fails to 
respond tp an item,- a par'aphase of the item which 
appears on the examiner's score sheet should be given. . 
If the child lails-to respond to the initial item and its 
paraphrase, the item is scored as Incorrect and the 
examiner is to proceed to the next item.- 
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Testing' continues through successive levels 
phild fails j^/aoswer 50% of the items at a pg 
level ^drrectfy. :When this occurs.' tes 
discontinued,: and -tho chiKl is diagnosed 
reached' his/h^r' instructional level for thejS 
Language. Management Program which 
appropriate English language actiyUL..^^ 
correspond to the child's level . of English masfafy. For 
example, if a child has progressed through level's B.artd 
C of the.'IPT by answering 80% of the items correctly. 
JjpC-jaiis. to: answer 50% of the items on Level; p 
f^drfeclly. testing is stopped, She/he would be classified 
;£s. having Level C maste[y of English and would begin 
receiving instruction a^level D. his/her instructional 
level' in the language m'anagement'program. 

There are no items for Level's A or M. The former 
classification indicates that a student failed to respond 



correctly to 50% of thS^i'tems on le^el B. The Level M 
.cl&ssificalion indicates that.80% of theitems on the.test 
(Levels;. B-F) were, answ^d cprrectly. Students who 
* place at Level M. the mastepp level, are said to have 
l^^adhieved ^mastery of the test and are therefore not in, 
*nee.d 6f instruction in the oral language , management 
__4*Eggram. , ' 



Score Conversion . ( '. " . 

A conversion table is provided to derive the 
corresponding NEP/LEP/FEP classifications from the 
IPT scores to aid teachers in making decisions about 
placement or exit from the Bilingual program, The IPT> 
score and its NEP/JJEP/FEP equivalent are on a sliding 
&ale which corresponds to the students' grade level. 
(See Figure 1). : " 



Figure 1 *' 



Correlation (petween IPT Test Levels and NEP/LEP/FEP Designations" 



Grade'; 
Level 
K 



NEP 



■LEP 



I NEP 



LBP 



LEP 




FEP 



When levels overlap; use ^fccher judgment for designations. 
NEP: Non-Engtish Proficiericy , : ^LEP: Limited English Proficiency'^ • 



^ / ""Mastery of Test 

FEP: Fluent EnglisfuProficiency 



-sr- 



•Dalton. Enrique f.-The IPT Technical ^A^^ua/^W]>ittier. California: Ballard andTi'gtie. Inc.. 1979, p. 23. 



Kindergarten students who score at level C or D on the. 
IPT are classified as fully English proficient, while first 
graders must score at Level D or E to be. classified as 
' FEP. Students in grades 2-8 must scqre^at Level E 9/ Fto 
' be classified as fully English proficient: Examination of y 
. Figure 1 shows that With the exception of I2ev6l A and 
Level F for children in grades 2-8. all' of the language 
classifications overlaj) with another category. In/these 
. cases, teachers are advisecl to u^e;teaCtier judgment to 
. determine in which of the two c^teppries a child should 
.be placed. • ' ■ ' ji : y>A . ■ ■"' '.<<? .' s 

The IPT Technical Manual states that these correlations 
^were determined ;by a research study; but' it is pot 
/ddScnbed in the. manual. There are several problems 
' * with this scoring system.. First of all. correlating L2 
. ' fearners' language proficiency with grade in school, is " K 



ur^cceptabl^s year in school dop^i^fP 'indicate the 
sahne amount of exposure to the teftguage i/i*the cake of 
L2 learners. Secondly, due t<# ; the .Oveflappihg of 
categories within grade levels' ^nd." reliance upon 
•'teacher judgment? '£0 classify students* one has to 
question the instalment's rehab.ilfty. Finally, at all grade 
levels students are classified as full^^^ffrsh proficjent.i 
but are riot eligible for exit from tfieftftlinguai. prog ram 
until they reach the next Kighest level of profid^ncy. In 
summary, the IPT scoring system., which assigns 
children to one of seven developmental levels of 
proficiency corresponding to grade level and then, 
converts that score to ai> NEP/LEP/FEP ciassification 
upon which , to makf judgments about bilingual . 
program placement, does not follow common 
procedures, and the! rationale /or this departure is not 
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Field Testing . . 

The IPT was "nor mod or) 2,061 identified N'EP/LEP 
-students in grades Kti y\/h6 wore enrolled in bilingual 
classes in rural, urban and suburban schools in' 
California/The authors, do not state how the students 
were originally identified as NEP/LEP/ nor do they 
provide information regarding their ethnicity or the 
.variety of Spanish which was spoken by^ them. The 
examiners who/ conducted the field testing were all 
bilingual/bicultural certified teachers who received a 
. ten-hour training session on the proper administration 
of the IPT While the IPT Technical Manual reports high 
reliability and validity figures, close examination of the 
test's theoretical base, ■ its explanation of what 
constitutes language performance and its .failure to 
recognize some fundamental differences in 11 and L2 
learners must call the use of this test as a language 
proficiency measure for L2 learners into question. 

.Perhaps the most glaring flaw in the IPT is its failure to 
, distinguish between the L1 and L2 language acquisition 

• process and characteristics of L1 arid L2 learners. In the 
technical manual, Dalton describes the following stages 
of language development in language acquisition: 

* babbling, echolalic, .telegraphic arid syntactic. He states, 

"The chi(d finds that language is a valuable tocit; -^ 
' and uses it more' extensively in daily social 
situations. Frequent opportunities to use the 
Janguage tend** to nurtJre and expand its 
growth in the child. This is to be encouraged^ 
• ■that oral language skills, both receptive afi$ 
Expressive, are the foundation^ of initial readi _ 
qnd, eventually, literacy". (Dalton^ 1980, p* 2 i;J| 



It is not .dear *w 
language develop 
have to 12 lear^ers.ol 




{inability stages «of 

Jatpguage; acquisition 
^ ^ rv; to ti ai: i naq speakers 

of English who neec^ur^ 
prior to reading: ($^^ 
possessing nearly/ f^Jir^ 

native language. Cr.uci$*J^^ ignored 
prior to jhe development* of tWfSs^r^hjiqh^ 
measure 'the language proficiency of both 11 and 12 
leaders of English, are: a) Does a child learning English 
as a&econd language progress through, the same stages 

J " 1 learner? b) 
the same 




in the language acquisition process^s,^. r 
If the .12 learner does not pas^Bt^ 



developmental stages as an U Tej|$ 
characteristics of. the L2 acquisitiS 
Which elements of the lexicon, synta^ 
functional language are acquired at eac 
the H and the L2 learner? " 



^at are The 
:ess?. and c) ' 

P'honplpgy and ' 
'stag§ by both 



There is a considerable body of research related to the 
similarities between the L,1 and 12 language acquisition 
.process. Brown's st^Yv 0973) was an observational, 
longitudinal study of English acquisition of three 
children. The sti^jound that children- learning English 
progress throug^seVies of stages where they acquire 
a sej of fourteen .grammatical morphemes in roughly 
the sarrie order. These results* were farther validated by 
a cross-seoi&nal study on the order of tnorpheme 



acquisition in twerity-one preschool-age children who 
were learning English as a first language (de Villiers 
and de Villiers, 1973). The morpheme sequence which 
was obtained i.n ,i tfi6- f de" Villi.ere' study was not identical, 
but correspOrfdedjiloselyto Brown's^ results. 

Brown's work served as the impetus for many 
subsequent studies of morpheme, sequencing in the 
area of. second language^cquisition. Dulay and Burt. 
■ conducted a series of sttfllies on children of different 
language backgrounds who were learning English .§6 a 
second language (Dulay &nd Burt; 1973, 1974a, 1974b). 
In these studies, the Bilingual Syntax Measure was used 
to elicit speech from children ages 5-B who were 
learning English as a second language. Analysis of the 
children's responses^ for the. presence or absence of 
grammatical -morpheme^, showed that LZ learners of 
English, regardless^ of, their language background 
demonstrated the Sfrie order of morpheme acquisition 
in English. From this, Dulay and Burt conclude that "the 
strategies of second-language acquisition are universal" 
(1973, p ; 256), It should be noted that the morpheme 
.sequence; which was found by Dulay and Burt was hot 
identical to Brown's or to that of de Villiers andjlg; 
Villiers. Dulay and Burt feel that the differences wt 
appeared in their data are due to the greater linguij 
and cognitive maturity of the L2 learner. 

- Other studies which examined the issue of the order of 
[norpheme acquisition in L2 learners of English and 
produced similar results were Fathrhan (1975). 
rashen, Sferlazza, ,Feldman . & Fathman .<fB76). 
.^h .^pse . studies used a different spJfOhvV 
ftation test; the SLOPE, the former found tharbojtHf 
^$sh -and Korean children exhibited a simri^r 
.flpf^eme acquisition order.while learning English. Thjfir 
I t ^ r : Vep IJ c^i Ved the study with adult L2 learners and 
obtained 3 i: s.if^ i t.a? *.a c q u i s i t i o n. se q u e n c e. These studies 
suggest that \h^^0rdef of morpheme acguisitioh^ i.cK • 
English by L2 ledrn^as; invariant regardle^ pf one's 1 
native language or agk" ;^hi]e this grouD,of studies has 
been criticized on several 'gVoiinds by Larsen-Freeman > 
(1975), x Rosansky (1976) and":;Hakuta (1976). the;/, 
similarities among the data are impressive and bear 
further research. 

In the presentation of the rationale for the IPT, Dalton 
mentions the original Brown study on 11 learners- of 
English only in passing. He fails to explain how the 
study'.relates'to the construction of the IPT or the items 
which were chosen to measure syntax at each -pi its 
levels, whichi contain items B-F. The issue o^ossible 
differences or similarities in the L1 and ^acquisition 
process is never raised. Furthermorerthe ordering of the 
items which measure one's control of syntax according 
to proficiency levels is illogical and does not follow a 
progression from simple., structures at low proficiency 
levels to complek at higher levels. Examination of the 
IPT shows that the material tested at each proficiency 
level corresponds to the material presented in the IDEA 
Oral Language Management Program.levels, rather than 
to levels of difficulty in language structure or to 
morpheme acquisition sequences in English by!1 and 
L2 learners. ' 
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Another indication that the IPT ignores characteristics 
of L2 learners is the fact that language proficiency 
levels are correlated to (jrado levels While it is possible 
to correlate a child's LI Iniujuage development with 
his/her grade in school, such a correlation in the case of 
the L2 learner does hot reflect, the same amount of 
exposure to the language. A child who is a new arrival 
and who is beginning to learn English in the 6th grade 
can not be expected to have the same langu$£e skills as 
a native, English-speaking child who is jrr^e same 
grade, furthermore, such a system failfi^fo accurately 
distinguish among L2 learners in the same grade who 
may have differing amounts of exposure to English due 
to their date of arrival in the United States in the case of 
older immigrant children, or due to- sporadic school 
attendance in the. case of migrant children. 

Construct Validity 

The IPT has even more serious problems of a theoretical 

.aatu/e.ir,uttat. to^J^P^^ < 
measure is not clearly defined. While the IPT is labeled 
as an oral language proficiency test the word mastery 
is used interchangeably with proficiency throughout. 
The point is made that linguistic competence must be 
inferred from linguistic performance ddta and that the 
constructs which underlie linguistic performance and 
upon which the IPT is based are the following: a) 
language is developmental; b) iitts incremental;' c) it is 
systematic; d) it is used to communicate in. a social 
.contexts and e) it involves both' receptive and 
productive skills (Dalton, 1§79, p: A-3). It is implied that 
some, of the * constructs which underlie linguistic 

. performance, as described by the author, are inherent in 
the IPT due to its format, which places the xhjjd in -a 
dyadic social situation vvith the examiner whfc|^:the 
child must comprehend and produce p' 



more difficult language, - From the child's^foSdhses, 
his/her linguistic performance is evaluated ancjjfcfe level 
of language proficiency is determined^Dalt^^tatefe;*; 
"Validation of the above constructs can be ihTferred if 
the data generated through the (construct validity) 
study demonstrate positive correJation with the age and 
grade of/the student and the IPT level res^1$M Da,tor> ' 
1979,p/A-4). " : • : ^ 

' The' conlifuct validity study was con^cted orv 364 
students in a suburban school district in California. An 
analysis was made of the correlation between age and 
IPT results on form A and B of the English version of the « 
■test. The data were further analyzed after excluding the 
scorqs of NEP/LEP students which resuJted in a sample 
which was^omprised of native speakws of English or 
L2 learners "who were fluent speaker&oM^nglish. The 
.same procedures were followed to dertaea correlation 
between IPT results and t^^tudents' grade level. Not 
Surprisingly, the study dfatained Jiigh positive 

"correlations between IPT .results and a^^and grade 
level with high levels-of significance on both forms v ;of 

. .the -test. ' „ v 

.VHiaving shown that a strong positive correlation existed 
** between the age and grade of native speakers, of 
P ^^sh a^id language proficiency in their own language. 



' Dalton concludes that those data' validate the 
constructs which state that language is developmental, 
incremental and systematic. In addition, ^he data are 
also said to validate the use ofjthe IPT as a method of 
assessing oral language proficiency through linguistic 
performance. r ^ . 

The fact that there is ajlstrong positive correlation 
between the age and grade in school of native speakers 
of English hardly seems to be a worthy subject for 
research. Furthermore, one has to wonder why the. 
construct validfty study was conducted and what its 

■applicability aS a rationale for a^ language-proficiency 
measure for L2 learners is. A n even more basic problem 
with the study is its definition of linguistic performance 
which is composed of vague generalities which are not 

. accurate. While language rT^y^be said to be systematic 

. and incremental in its development, one's linguistic 
performance is not. Linguistic performance is a 
speaker's use of his/her linguistic competence 

Junderlyjjpjg knowledge of the system of a language) at 
any* one time." IrT V TiaWe* fcf>eaker~ of a language, 
linguistic performance may vary from one occasion to 
the next due to such factors as memory, distractions, 

. shifts of attention, etc. (Maclay & Osgood, 1 959, p. ,24) 

Language proficiency measure^att^pript to accurately^ 
measure one's lingujstic competence<^^frferenc6 / to 
linguistic performance data whic£ .^Variable. Due v t^. -. 
.... this fact, a more germane consyAt upon whi^^tS>v. : 
- measure a test for L2 learners is, wfib't is the nature^bf*" 
. jcomnr^nicative competence? What knowledge must. 
\;$vbhe possess ifc^rder to produce^grammatically correct 
, "speech "in -sitUatibn^llyr-appropri^te' contexts? Such an 
approach requires the^Vticulatidn and validation of the . 
learnef^stages of language acquisition in' the areas of 
"syntg*;* r^orphology, lexicon, phonology, functional 
: ' language and sociolinguistic skills. A cornplete^heory 
:> of r . communicative . competence would also have to v 
; ,8rder and weight items tp..indicate which are the most ^ 
crucial for achieving intelligibility in a language. Whiles > 
%uclva theory has not been luky developed to date, ' 
researchers in the-area^of language testing are working 
toward tjjpt end. ^Federal and state regulations which 
re^jJlf^ifinguage proficiency testing fot 1 . children in' 
birato^prograf^have caused great strides to occur in 
the ^^n/ecenjt^ars. Unfortunately, the IPT doesnov „ 
refj^^|£^ sugg^ted practices in the 

• • fieK^K%^ ; ^; . 0>y& . ■, 

ContehtVai^ - J 

' . ; ^/'^>'/y^; : ';-,-y>v-" .; ' \ / ^ • . & 

Content validity is the^^tQ^to -whicih a tefst^c^ers a^ 
representative- sample of behavior in the domain 'to b^ 
measufey.lThe IPT purports to measure' lan^ua^^p; 
mastery iff the areas of syntax, lexicon, phonology, : 
morphology, comprehension antKoral expression 3n<d;T 
reports extremely, high percentages of items whicff^ 
correspond to and measure each one. This stems from 
the fact that test items were scored for all of the a ^9 v ^. 
categories which could possibly pertain to a respbh^e^ 
and not to those which are, reflected in the IPT seeing 
system, Eor example, the item, ' Who is this"? "wfvich ; 
requireslhe student to supply the lexical it^'m, teacfigk* 



is said to measure lexicon, phonology (A/), morphology 
(/or/), comprehension and oral expression. In reality, the 
item only measures the production ol a lexical item, as 
the IPT scoring system dons riot provide for the scoring 
o( pronunciation, comprehension or oral expression. 
The technical manual reports that 100% of tho items 
measure comprehension, but this is not so. In the above 
example, the student could have understood the" 
question perfectly well, but might not have known the 
word teacher and supplied anothor word. From such a 
situation, one would bo mistaken in concluding that the 
child. did not comprehend the language in the question. 

Within proficiency levels, the IPT has a very restricted 
range of items which measure each area of language. 
Form A of the English version of the test has an average 
of 16 items in Levels B-F. An analysis of the test items / 
showed that over half of the items at each level (witlj^ 
the exception of Level F) call for the student to produce 
isolated vocabulary words in response to questions 
about a picture (Level B - 58%; Level C = 31%M.evfl.l B-U <v 
- 33%; Level F -18%). Of the twelve items onl^^^ 
only three measure the child's knowledge of syntax an|"V. 
each tests a different structure. Due to. the manner iifc' 
which test items were classified by the areas' of 
language they purport to measure and also the 
^restricted range of item§ tested at each level,' the IPT's 
content validity is suspec|g ' 1 ."■* 

Criterion Validity 

■ th£ fPT test results were checked against three criteria 
to determine their criterion validity. Cross-tabulations of 
students' scores on the English version of the IPT. forms : 
A and B. were made with' a) teachers' prediction^ of . ^ 
students' IPT scores, b) students.', previous scores J ori- 
state-adopted language proficiency tests 'aad c) 
teachers' predictiods of students' NEP/LEP/FEP 

'classitfcati^hs. The technical manual reports higlt 
Validity figures and levels of significance for all three 
analyses and concludes that the IPT has criterion and 
predictive validity as a measure of language 
proficiency. 1 

.J^e^most interesting result was the correlation of 
-Stq^fe-^t* scores with their NEP/LEP/FEP 
^as^ffcatigrtl a's determined by one of four Caiffornia 
.^tate^9,rpVed oral language proficiency tests:; the 
LAB, the LAS, the BINL, and the BSM. 2 The statistical ^ 
analysis correlated the students' scores on^ the^ IPT, 
which could be one of the seven possible levels A-F and 
M, with their NEP/LEP/FEP scores as determined by one 
of the state-approved tests. High positive correlations 
between IPT scores and the classifications reported ,by. 
:the state-approved tests were obtained (Form. A. 
"Pearson's R = 75 art the .00001 level of significance, n = 



721; Form B, Pearson's R « ,59 at tho .00001 
significance level, n— 492). 

These data tire interesting when viewed in light of a 
previous study conducted by Ulibarri, Spencer and 
Rivas (1981) which examined the comparability* of 
language proficiency classifications produced by the 
LAS, the BSM and the BINL. The test scores of over 
1,100 students in first, third and fifth 'grades were 
compared. The study showed that the proportion^ of 
students who were classified as NEP/LEP/FEP by each 
test were highly disparate. They concluded that since 
the three tests did not assign approximately the same 
.percentages of students to the thred* language 
.^proficiency classifications, they were not comparable. 
They also warned that if different tests were used for 
reclassification of stuc^nts, it would be possibly. for a 
student to enter and exii a bilingual program oh the 
same day (p. 79). ■ . 1 f - 

Given the fact that three of the language proficiency 
£ tests used in the IPT criterion validity study were those 
' examined in the Ulibarri, et al. study and found to be 
incomparable in assigning students to the same 
language proficiency classifications, the criterion 
validity figures which were stated for the IPT seem 
implausible. If three of. the tests which were used as 
criteria against which to measure the predictive validity 
of the IPT do not show agreement among themselves in 
assigning students to language-proficiency levels, how 
could they show a high positive correlation with the IPT 
when a fourth test the LAB; was added? 

While the IPT reports high construct, content and 
criterion -validity. .close examination of the studies casts 
shadows on its, validity ^as an_ accurate measure of oral 
language proficiency^ for L2 learners. For toa.t Feason. 
the question of its reliability will not be addressed. 



Conclusion 



There has been a renewed interest in the subject of 
language-profiGiency testirftf in recent years largely due 
to mandated language-proficiency testing of students 
participating ;, in bilingual education programs. The 
original irrrpetus for mandated testing resulted from the 
Aspira Consent Decree of 1973. The majority of state 
and federally funded bilingual programs are transitional 
in scope and are aimed at bringing th'fe 
limited-English-proficient (LEP) child to the point where 
.she/he cart exit the bilingual program and begin 
instruction in the all-English classroom. Although 
language-proficiency testing in bilingual programs was 
mandated in order to identify both students who were 
eligible for program participation and those who had 
acquired sufficient English- language proficiency to 



^Correlation of Teachers' Predion of Students IPT Scores. and IPT Results - Form A. Pearsons R = 79 at **™ 0 ^^^ 
Pearsons R - .66 at the 0000 V s.gn.hcance-level. Students' Previous Scores on Stat^Adopted &WJW^*'W. T £?j£!%Ji"^ ni 



59 at the .00001 significance level: Teachers' PrediC Hons of 
71 at the .00001 significance level; Form B. Pearson's R - .63 



Fofm A Pearson's R - 75 at the 00001 significance level; Form B. Pearson's R 
Students NEP/LEP/FEP Classifjcations with IPT Results — Form A. Pearson's R ■ 
at the 00001 significance level 

2 The abbreviations refer to the following commonly used proficiency tests: LAB - Language Assessment Battery. LAS 
BINL — Baste Inventory of Natural Language. BSM ~ Bilingual Syntax Measure. ■ > 
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warrant thoir exit from the program, virtually nono 
existed at ihotimo 



approximate the process which speakers of a language 
go through when communicating in a language, 



Previous language-proficiency testing was mostly done 
within the area of foreign language pedagogy, With the 
exception of the Foreign Service Interview (FSI)*Which 
was used by th,e State Department, the 
language-proficiency tests which were used in foreign 
"language classrooms were paper and pencil tests which 
emphasized receptive, rather than productive, language 
skills, Furthermore, due to the influence of structural 
lihguistidfc. they were discrete, point in nature and 
tested only one specific language skill in one 
commorient of language at a* time, e.g., the ability to 
cbrrjprehend elements of the phonological system, etc. 



iese types of measures were unsatisfactory for they 
did not always positively correlate with one's ability to 
communicate in a language and did not satisfy the 
requirement of the Aspira Consent Decree which 
stipulated that "the placement of children in 
educational programs using English or Spanish as, the 
medium of instruction be. determined by their ability tq 
'effectively' participate in instruction" (Shuy, 1977. p. 
79). When viewed in the light of prevailing linguistic 
theory, their definition of language proficiency as 
control of the syntax, lexicon and phonology of a 
"'language is too narrow and ignores a native speaker's 
knowledge regarding functional language and the rules 
which govern language use. . 

§£my .J$&77) defines functional language competence 
^aV?ifi8§jij^ to use 

* their languag&tQ roake. utterances to others in terms of 
their goals/ltrotifecles 'a knowledge of what kinds of 
goals language can accomplish (the functions of 
language) and of what are permissible utterances to 
accomplish each function" (language strategies) (p. 81). 
Examples of language functions would be giving an 
order, making promises or apologies, etc. Operating 

• from this position, mo^ integrative or global language 
/proficiency measures are called for which more closely 



While proficiency measures todqy do strqss productive 
skills, they are still for the most part discrote-point or at 
best quasi- integrative (tests which employ 
discrete-point scoring of speech samples). They still 
focus mainly on measuring thoSe elements of language 
which are the most testable and most easily quantified 
— phonology and lexicon, Their focus is. narrowed 
further in that* the language used in the tests is 
generally restricted to the school domain. 

Language-proficiencVvtesting is poorly understood and 
approached with dread by teachers and administrators 
who feel that language-proficiency test results provide 
no answers for them at the instructional level, They see 
it as a- time-consuming activity which impinges upon 
instructional time and yields no useful information 
♦ regarding how one should instruct the child 
Subsequently.^'TTiey'^wahl^td know, given u the 
NEP/LEP/FEP classification of a student, what kind of 
instructional program should be provided # ^4hp child 
to help him/her achieve English proficiency^i^P 

The IPT Oral Language Proficiency Test is a response to 
this very real concern of teachers and administrators in 
that it provides an oral language~management program, 
&1- developmental activities purporting to enhance a 
Student's language proficiency based on its result^. It 
also claims to prepare L2 learners, as well as L1 learnersrf 
who have language^which is not fully developed, fqp v 
reading instructiOQ^lOsev^examination of the IPT has 
shown that it has such theoretical and technic 1 
problems that one must question whether it aC&ISljY 
measures language proficiency or simply a cg^d's. 
progression through the IDEA Oral Language 
Management Program. As it is doubtful . that the 
theoretical construct — language proficiency — the IPT 
purports to. measure really is being measured, both its 
claims and the advisability of its use for L2 learners are 
questioned. 
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Introduction 

Over the past quartor century various classification 
schemes, taxonomies or rubrics ho4o boon proposed for 
uso in analyzing the adequacy of educational tests. Tho 
Taxonomy of Educational Objectives: Cognitive 
Domain, edited by Bloom (1956). had its origins 
primarily in general, college-level, achievement testing. 
Neither this classification scheme nor the subsequent 
one by Krathwohl, Bloom and Massia (1964) focused 
specifically on language ^assessment. However,- the 
applications in this area, particularly of the' 'Cognitive 
Domain Handboojc, are mgny ; OLmore direct relevance, 
though, is a rubric suggested by-Stansfield (19B2). He 
has proposed a three-dimensional design which 
incorporates linguistic components (phonology, syntax, 
lexicon), the traditional communication skills (listening, 
speaking, reading, writing) and a sociolinguistic domain 
(home, neighborhood, school). This 3x3x4 matrix of 
thirty-six cells d^als with such a wide array of language 
assessment possibilities that^it would be beyond 
practicality to tap all of these aspects with one test 
instrument, "Nonetheless, the matrix is useful in judging 
the validity of an instrument. Thus, as a general rule, the 
more cells we find included in a test, the greater its 
' validity" (Standsfield, 1 9B1, p. 236)* 

With yalidity^as a m^r fbcaV^pcJint' then, it would' be 
well to cite the Standards for Educational* and 
Psychological Tests (1974) as another approach to 
analyzing test adequacy. Under the heading of Content 
Validity standard E12 states: "If test performance <is to 
be interpreted as a representative sample of 
performance in a universe of situations, the test manual 
^should give a clear definition- of the universe 
represented . v." It comments further: "The definition 
should be operational, rather trwi . theoretical, 
containing specifications regarding cteskes 'of stimuli, 
tasks to be ; performed and.observations to be scored" (p. 
45), An exploration of how some test makers have 
defined specifications regarding classes of stimuli and 
tasks to be performed has led this writer to the 
conclusion that another supplementary rubric is needed 
to jOdge the adequacy of language assessment 
procedures: 

Basis for Another Rubric 



BefSre proceeding further, a few definitions may be in 
jDrder. A remark by Foster (1 974) can serve as a basis for 
Clarifying what is meant by . correct Standard English or 
correct Standard Polish or correct Standard (any 
language you can name). "It is unfortunate that school 
personnel too often are rigidly bound and limited b' 
theff poncept of correct Standard English. In the realit^^, 
of the life : outside^ tha school, however, the P*oper|^ 
language is the language that brings people togetbenv 



and succeeds in accomplishing anobjoctivo" (p. J (33). 

Another term requiring clarififcqticrn is asse^apent A„ 
definition whicf) thjs writer has| found usefufSruns as 
follows: assQ&ement if ^(ie 9yS v tio^a^yecordinfl ] 
reposing <pf ; aj beihavior crianjgi^nlch W f &ri$ldered 
desirablewfelStiorvto ci'p^eviou^y-stdted objective. 

An obvious comm9n thread running through both 
definitions is the necessity for having an objective. But 
how can ppe get a h'anble on the vast number of 
educational, objectives thai would constitute a "well 
rounded" curriculum? TheyVlational College Verb List 
shown in Figure 1 has been pf some help In this regard. 
An earlier edition of this list was published by Gronlund 
(1970). The list as shown here appears in Pucel and 
Knaak(1975). 

The 455 verbs have been found to be gseful in 
generating statements of educational objectives and for 
developing assessment procedures to measure whether 
or not a learner has accomplished them. Clearly, it 
would not be possible to assess a person in all these m 
behaviors at one time, tt&wever, as shown in Figure 1, 
the verbs and associated behaviors can be grouped in a 
number of ways. Further examination of the list reveals 
that additional classifications arepossible. 

One oV these other ways of classification is related to 
'the traditional dichotomy of testing techniques: 
Recognition test items vs. Recall (Mitchell, no date). ^ 
mo're functional, description of these two could be. 
respectively: Selected Response items of the 
multiple-choice type, and Supplied Response items 
hich involve single 6ral or written answers, as well as 
ded discourse both in writing and speaking along 
II other remaining types of responding, ^pther 
tomous grouping whicl^foJIows from this, reflects 
2 Dimensional, silent, pencil-and-paper behavior vs. 3. 
Dimensional J^havior inYPj^rfc^fa^verbaHproductipn 
or interactions' bT&tWeen peaipie and^Bfctfteen people 
and things. Thus, the 2x2 scheme or rubric shown in 
Figure 2 can be generated. 

When we attempt to apply this 2x2 rubric in the 
specific area of language assessment, however 
something seems to be missing. The missing facet 
emerges when referring to the admonition in the 
Standards for Educational arid Psychological Tests 
about "specifications regarding classes pf stimuli." 
What has jpeen ...jeft out is. tTie medium or channel 



tKrough V whj cf);3W 
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•sfim uTf; are . presented ^ to an 
leads to a thjrd dichotomy: oral 
printed stimuli. Figure 3 Shows 
l^i.e contains when all three* 
Jbirted. Altogether, eight cells 



I 



Figure 1 
Index.Verborum Permlttorum 



The Functional, Forceful Four Hundred Fifty-Five 



Altor 
Ask 

Change 
Design 



\ Genorulizo 
Modify 
Paraphrase 
Prodtct 



1, "Creative" Behaviors 

Question « Rogffcup 

Rearrange ^Rename 
Recombino Reorder 
Reconstruct Reorganize 



Rephraso 

Restate 

Restructure 

Retell 

Revise 



Rewrito 
Simplify 
Synthosizo 
Systematize 
Vary * - * 



Analyze 

Appraise 

Assess 



Combine 
Compare 
Conclude 



2. Complex, Logical, Judgmental Behaviors 



Contrast 
Criticize 
Deduce 
Defend 



Designate 
Determine 
Discover 
Evaluate 



FormuJate 
Generate 
..Jflduce 
Infer 



Plan 

Structure 

Suggest 

Substitute 



Choose 
Collect 
Define 



Describe 

Detect 

Differentiate 



3. General Discriminative Behaviors 



Discriminate 

Distinguish 

Identify 



Indicate 4 

Isolate 

List 



Match 
Omit, 
Order 
Pick 



^Place 
Point * 
Select 
Separate 



Accept 

Admit 

Agree 

Aid 

Allow 



Answer 
Argue 

Communicate 
Compliment 
Contribute ' 



Cooperate, 
Dance 
bisagree 
Discuss 
Excuse , 



4 /Social Behaviors 

Forgive 
Greet 
Help 
Interact . 
Invite 
Join 



Laugh'*'-' 
Meet 

Participate 
Permit 
Praise 
React 



Reply 

Smile 

Talk 

Thank 

Visit 

Volunteer 



5i Language Behaviors 



Abbreviate 
Accent, 
Alphabetize 
Articulate 




Call 

Capitalize 
Edit 

Hyphenate 



Circle 
Classify 
Compile 
Copy 



Indent , 
Outline 
Print ^ 
Pronounce 



Punctuate 

Read 

Recite 

Say 

Sign 



6 "Study" Behaviors 

Diagram Itemize 
Find . v Label 
Follow Locate 
Gather Look 
Map 

7. Music Behaviors 



Speak 
Sp^ . 

5ummar ( tee 
Syllabicate 



Mark 

Name 

Note 

Organise 

Quote 



•Tell 

Translate 

Verbalize 

Whisper 

Write 

V 



' Record 
Reproduce 
Search 
Sort 

Underline 



Blow 
Bow 



Arch 

Bat 

Bend 

Carry 

Catch 

Chase 



Clap 
.^fflflwn'eftse 



•A- 



-Tloal^ 
Grab 

* Grasp 
':. Grip 



Finger . 
Harmonize 



Hum 
«j£rtute 
Play 



8. Physical Behaviors 

Hit March • 

Hop FMtch 

Jump Pull 

Kick t Push 

Knock Run 

Lift Skate 



Pluck 

Practice 

Sing 



Ski 
Skip 

Somersault 
Stand 
Step 
Stretch 



Strum 
Tap 

Whistle 



Swim 

Swing 

Throw 

Toss 

Walk 



Q.ttrft Behavior 



Atwenihle 




l ' „■. • f t until 

J f til Ho 


Mnl/i 


Roll 


Stamp. 


Hlond 




" j 1 . * ■ HaiYimar 




Huh 


f Stick : 


Brush 


Dot 


Handle 


Paint 


Sand c> 


Stjr. 


Build 


Draw 


Heat 


Past*.., . 


Saw 


Trace 


Carve 


Drill 


illustrate 


Pal > - * 


Sculpt 


Trim 


Color 


Told 


Molt 




Shake 


Varnish 


Construct 


Form 


Ml* 


11 














Smooth 


Wrap 






* 


10. Drama Behavior* 






Act- 


Direct 


Enter 


Irritate 


Pantomime 


Responc 


Clasp 


Display 


Exit 


. Leave 


Pass . 




Cross 


• Emit 


Express 


, . Move 


Perform 


Sit 








Proceed 


Turn 



11. Mathematical Behaviors 



Add 
Bisect 
Calculate 
Check 

Circumscribe 



Align 

Apply 

Attach 

Balance 

Calibrate 



Button 
Clean 
Clear 
Close 



Aim ; 

Attempt 

Attend 

Bejgin 

Bring 

Buy '•• T 
-£ome 

Complete 

Correct 

Crease 

Crusri 

Develop 

Distribute 

Do 

Drop 
*End "i 



Compute 

Count 

Cumulate 

Derive 

Divide 



Conduct 

Connect 

Convert 

Decrease 

Demonstrate 



Estimate 

Extrapolate 

Extract 

Graph 

Grc-up 



Integrate 

Interpolate 

Measure 

Multiply 

Number 



1 2. Laboratory Science Behaviors 



Dissect 

Feed 

Grow 

Increase 

Insert 



Keep 

Lengthen 

Limit 

Manipulate 
Operate 



Plot 
Prove 
Reduce 
Solve 
Square 




Plant 
Prepare^i^ 
Remove. 
Repl 
Report 
Re 




^Sufc^ract 
W> . SunT 
Tabulate 
ally 
rify 



;:5et- 

"Specify 
'Straighten 
"Time*, 
1 Transfer 
Weigh 



1 3. General Appearance, Health and Safety Behaviors 



Comb 


Eat 


Fill 


Taste 


Unzip 


Cover 


Eliminate 


Go 


Tie 


, Wait 


Dress 


Empty 


Lace 


Unbutton, * 


Wash 


Drink 


Fasten 


Stack 


Uncover 


Wear 






Stop 


Untie 'j 


Zip 






14. Miscellaneous 






Erase 


Hunt 


Peel 


Scratch 


Store 


Expand 


Include 


Pin 


Send 


Strike 


Extend 


Inform 


^Position 


Serve ^ 


^Supply 


Feel 


Knogl 


Present 


Sew 


Support 


Finish 


Lay.. 


. Produce 


Share 


Switch 


Fit 


Lead 


Propose 


' Sharpen 


Take 


Fix 


Lerftf 


Provide 


Shoot 


Tear 


Flip 


' Let 


Put 


Shorten 


Touch • 


Get* 


Light 


/flaise ^ 


Shovel^ 


Try 


Give* 


Make 


Relate . 


Shu^i v 


Twist 


Grind 


Mend 


Repair 


* Signify * 


Type 


Guide 


Miss 


Repeat ' 


Slip 


Use 


Hand 


Offer 


Return 


Slide 


Vote 


Hangr : ■ 


Open 


Ride 


Spread 


Watch 


Hold 


Pack 




■ Stake 


Weave 


Hook 


Pay 


Save 


Start 


Work 



rTNotfcthat alfof the vwbain this entire 



ist could be subsumed under either Showox Tell. 



Figure 2 . 

Examples of Verbs Classified According to f\ 2 * 2 Scheme 



o 



Seleded Response 



Choosu 


. 'Arrafiyu 


DtiltiCt 


' TaltKJori/n 


DiffHrtmtialm 


Circlo , 


Discriminate 


Classify 


Distinguish . 


Find 


Identify 


Locate 


Indicate 


Mark 


March 




Omit 




Order 




Pick 




Select 





/ 





Choose . .'. 


Arrange 




Detect * 


Categorize 




Differentiate 


Classify 




Discriminate 


Find 




Distinguish 


Gather 




Identify 


Locate 




Indicate 




o 


Isolate 




> 

jC 


Match 




& 


Order * 






Pick 




CO 


Place 






Point 






Select 






Separate 





Supplied Response 

Dyfl?V>, in writing. 
Describe, in writing. 

List . f ' 

(all but two or thrtttj of the "creative" behaviors)-, 
(all of the complex, logical. |udgmontal behaVjors) 
(almost -all the mathematical behaviors) 



Chart v 
Cite 

Compile 



Diagram 
Folloyv 
Itemize 
Label 



Map, 
Name 
Note 
Qrgomzo 



Quote- 



Dofine, orally, ... * 
Describe, orally. . ... ( i 

List 

ia\\ but two or three of the "creative" behaviors) 
(ail of the complex/logical, judgmental behaviors) 
Mall of the social behaviors) 
(all of the music behaviors) > 
(all of the physical behaviors) 
(almost all of the arts behaviors) . . ■ 

(all of the drama beh^ors) . -V ■ 

(almost all the laboratory science behaviors}-. ; -' 
(all of the general appearance, healtji. j>nd- safety 
behaviors) J '■ 

• • * . . : ' v 

Follow . , ' . • 

Label, orally ... , 
Name, orally ... 
Organize 

Quote * 
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> 

CD 
-C 

0) 
CO 

Q 

eg 



> 

CO 

.c 

CP 

d 

CO 



* ■ 

*■ 

' , V' 




/ 
I 




! 


. I 


\ 




/ 




Selected Response 

■» / 


Supplied Response^ 



/ 



r 



Application of the Cubic Rubric to a Sample of * ^ 
Language Tests 

Figure 4 and Figure 5 illustrate how well two language 
tests fill the dells in the proposed classification scheme. ' 
A preliminary step was to phrase a statement an 
educational objective, for each identifiable part of the 
test being analyzed. In Figure 4 the parts of the Illinois 
Test of Psycholinguistic Abilities (Kirk, McCarthy and 
Kirk, 1968) are checked againsrt the rubric. All sections 
of the Short Tests of Linguistic Skills (Frederickson, 
1976) are the ones shown in Figure 5. 

In Figure 4 the ITPA touches all' but .three cells, w[tb 
heavy representation in the 2D, Selected Response, 
Printed. Stimuli and the 3D, Supplied ^Response, Oral 
Stimuli categories. A better balance a.n<J more complete 
coverage of cells "is evident' in Figure 5. Here the STLS 
missed only in two categories. When gaps appear 4 it 
nowlis possible to generate a set of statements and 
related 'test situations to increase the coverage. For 
example, on the STLS in the 3D, Selected Response, 
Printed Stimuli Cell, an objective such as the following 
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might fit: Pick out, from each of six. groups of four 
objects, the one specified by a set of printed directions. 
For the missing 3D, 'Supplied Response, Printed Stimuli 
Cell, a statement such as this might suffice: Ask 
someone for help as if being in the position of a, person 
in three different situajions described in writing. 

In addittoh to the two tests analyzed here in some 
detail, a quick, summary check was made on^seventeen 
other tests taken \ in a sampling of the ETS {Test 
Cqilection: Tests in Microfiche (1975 through 1981). 
Figure 6 shows how broadly or narrowly each test 
matches up. to the facets of the cubic rubric. It becomes 
quite clear, here, that-several tests barely tap into the 
total number of possibilities for assessing language t 
competence. This serves as a caution, rather than a 
value judgment for test users as well as test authors. If 
factors of time, money and energy loom laVge, then 
some aspects of linguisitic performance will not be 
assessed. S.tansfield 11982), after ' developing a 
forty-eight cell 4 x 4 x 3 matrix, said . . test authors 
must make choices as to which cells will be included 
and which will be left out" (p. 237). 
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. Figure 4 









Selected Response . 


Supplied Response - 


- 




1 


7~^~~~1 ■ ■ ' , ■■ - 

Select, from 4 different pictures, the one which 
yis like another picture previously exposed for 
only 3 sees, 






:ed Stimuli 


2D Behavior 


Select, from 4 different pictures, the one which 
has a functional relationship to another r 
pictyre, ' 

'Select from 4 different pictures' th.eone which 
has arelationship to a fifth picture which is 
analogous to the relationship between two 
other separate pictures. } - 


4 




c 
a. 




Locate, ffi 30 seconds, as many designated, * 
pictured objects as possible which have 
been embedded or camouflaged within each t , 
of 4 designs. 


\ * i 

■ 1 < • ■; 






3D Behavior 


Rearrange three or more pictured forms 
according to a pre-arranged, model 
sequence after a 5-second exposure of the 
model to be copied. * » 


Pantomime how.each of ten pictured common- 
objects is used. 

. . ■ { . . .. ' 






2D 
Behavior 


• >■ — 


■ * 








Agree or disagree, by word or gesture, with " 
assertions that particular named animals, 
plants, people or things respond in certain * 
"*ways. 


Complete, with one spoken word in each case, 
short sentences presented orally involving 
analogies some of which incorporate 
opposites. 








» - ■ 


Repeat, orally, spoken digit sequences 

involving from 2 to 9 digits, presented at a 
rate of two digits-per second. 




Oral Stimuli 


behavior 


i 


Describe, orally^each of five real common ■•• 
" objects in terms of multifSlfi^char^cleristics 
such as name, color, shape, composition, 
function, parfs, numerosity, simile, 
metaphor, or relationship to other persons, 
places or things. 




7 


O 

CO 


• 

J 

• ; ■ , ^ 

i 


» \- * V 
tomplete short spokerY and illustrated \ 
sentences, orally, with otfe word which, in 
each case, follows a rule of grammatical 
usage, number, tense, or a 
cofKparative-superlsttive relationship. 

Say the complete word for^ach of 30, 2 to 5 
syllable words when presented orally with 
some syllables missing. 

Pronounce real and nonsense words composed 
of two to seven sounds when presented * . 
• orally in serrated phoneme units. 
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Selected Response / 


Supplied Response • ' 




o 


' — » J. 0 

Choa$%?amorig gjvervprinted yrarements, the. 
best response to^uestions about fhe^ ' ! 
content in eadn of two written paragraphs. . 0 


' * . 
Complete, in writing, each of five sentences 
containing a missing word suggested by a/. 
- pictorial prompt. . ■ .' * • \ **' 


tinted Stimuir 


2D Behavi 


Identify wj>r£h of four printed words best ' : \ 
con>j5fetes each of ten statements. ^ 

Diffe\entiate P which of three worlds, if any. is 
. misspelled ir> each of five examples. 


Write-in an appropriate word or short phrase 
that completes each of five.printed 
sentences in answer to a written question. 

Answer, in writing, five printed questions about 
•.a cjiven picture/ 


Q- 


k. 

o 
> 

& 
O 

CO 


- . ."(-■ ■.:■ — 


X ' , ; ' ' 

i * 




o 


Pick, from three printed words, the one which is < 
spoken in each of five examples. 


Write, fromtiictation. each oyive examples . 
containing from one to five word^. 


al Stimuli 


. 2DBehav 


Select a printed YES or NO answento eacKpJ 
five spoken questions about everyday 
• objects or events. " * ' . 


( ' Respond, in wanting, to five simple, spoken . 
\ _ imperatives involving common, everyday, 
wbrds and relationships. 


o 


3D Behavior * 


Verbalize which step on a four-point scale 
represents personal competence in four * 
types of communication described orally. 

) • . 


Reply, orally, to five simple spoken questions 1 
aboufeveryda^ events. 

Respond, orally, to questions about simple 
things or actions which are shown. ° 



Figure 6 
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a. 


>.Q3 

O 
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CD 
C/5 


a 

w .'. 


X) 
C/5 
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C/5 




03 

C/5 
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C/5 


CD 
C/5 
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Q 

CM 


Q 

CM 


Q 

CO 


Q 

CO 




Q 

CM 


Q 

CM 


Q 

CO 


Q 

CO 


Test Reviewed % , . & 


















) 


„ ShortTests of Linguistic Skills (Fredericksbn, 1 976) . 


★ 


★ 








★ 


★ 


★ 


★ 


. .Illinois Test of PsychbTingffistic Abilities (Kirk, etal, , 
" " 1968) - ' 


★ 




★ 


★ 








★ 


* 


Spanish-English Dominance Assessment Scale , h 
(Spolsky, 1972). 
















★ 




Intermediate Reading ComprehensioruTest in Modern 
"Chinese (Chbu,et al. J 964) 


★ 
















r 


' Pictorial Auditory Comprehension Test (Carroll/and Ho, 
T959) • 




















Flexibility Test to fvleasure Dominance in 
-Spanish-English Bilinguals (Keller, 1974) 


★ 










I 


/ 






Ambiguous Verbal Stirrful us Test to fyleasure 
Dominance in Spanish-English Bilinguals (Keller, 
v1 974), 




















Navajo-^glish Language Dominance Interview ^ 

(Spolsky, 1974) • 
















★ 




Test of Language Judgement (Kahn, 1971) * 


★ 


★ 










j 






Test of Grammatically Correct Spanish and English (de 

Mestan, 1976} ; ^ 4 


















★ 


Ott Test of Oral Language: English and Spanish (Ott, 

1970) ' . ' * , . \ < K 




it 




• 

•k 










★ 


• French Achievement Test: Language £rts; K through 5 
(Delahoussaye, 1973) 




















French FLES Test. (Barry^et al., no date),, . , 


★ 










★ 








West Hartford Publrc Spools Spanish FLES Test (West n 
Hartford, 1976) Q\ \ . 












★ 








English as a Second LaiSgOage Assessment Battery 
(ESLAB) (Rivera & Lynb.ardoS 1 979) 


★ 


★ 






3 

Of 




• ★ 






Language Dominance Survey: Grades K-1 (Language 
Dominance & purvey, 1974) 








• 

★ 




★ 








Language Dominafkie Survey: Grades 2-4 
















* 


★ 


Language Dominance Survey: Grades 5-7 












T 


★ 


★. 


* 


Language Dominance Survey: Grades 8-12 


★ 












★ 


* 


* 



•Discrete classification within the rubric is complicated in these two instances: the 
pictures, but the directions on how to respond to them are presented orally. 



primary stimuli a rfe printed 
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How 



Deciding 
Should Be F 



done before 



Many Cells of the Cubic Rubric 
lied 



Since the STLS has been shown to assess. 75% of the 
types <of pe formances outlined in the proposed cubic 
rubric, somo might say this is sufficient.- Certainly, it is 
the most -complete of thosq analyze^ here. With the 
addition o\ wo more sets of tasks, it would give 100% 
coverage. While . this might be an ideal goal to be 
reached, there are empirical studies that remain to be 



make sure that, a just judgment is made of a person's 
language" competence, more, much more is needpd, At 
the moment it might be best to aim at constructing a 
test which touches most if not all eight cells 'oMhe 
cubic rubric. 



Conclusion* 



a definitive statement can be made. There 



is some evidence that if we , rely on competence 
measures in one cell to predict competence in^nother 
without directly measuring it. gross errors can be made. 
Moss. Cole land Khampaljkit (1982) found a correction 
of only .12 between a 2D, Selected Response. Printed 
Stimuli type test and a 2D. Supplied Response. Printed 
Stimuli Tes,t. In more traditional terrfls. this is a 
correlation study between scores on an essay test and 
scores on a multiple-choicS ranguage usage test. This 
low correlation was for fourth graddrs. At the seventh 
grade, the correlation went to !39 and to .47 at the tenth 
grade. i 
•i 

The ease t wit;h ,which a multiple-choice language test 
can be aorninistered and scored is frequently the basis 
for a decision that only' this 2D. Selected Response. 
Printed Stimuji typ? examination be given. However, to 
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FIFTH INTERNATIONAL SYMPOSIUM ON EDUCATIONAL TESTING^ 
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At ETS.,since our founding in 1947, we have developed 
foreign 'language tes.ts. It was^ptrf ur\tiT the I96(fs/ 
hoyvfever, that we attempted to.assess speaking ability. 
The Modern Language Association of America-received 

'a grant from the U.S. Office of Education for 
development of listening, speaking, reading and writing , 
tests in French, German, Italian, Russian and Spanish. 
These monies become available through the NDEA. 
Sputraik had frightened Jegnslators into providing funds 
for the improvement of instruction in mathematics", 

\ science and foreign languages. Foreign language 
teaching methodologies and materials underwent a 
radical, transformation. The model for much of this 
change was the - old *Afmy Language School in 
Manterey, California (now DLI). Among the consultants 



Protase E, Woodford 

Educational Testing Service jf 

• ... * « 

for the speaking test were representatives of the 
Foreign Service Institute of the U.S. Department of State. 

FSI, in the late 1950's, was faced with the problem of 
both describing and evaluating the • face-to-face 
communication skills of foreign services officers in 
dozens of languages. It did not enjoy the internal 
accountability system of the academic foreign language 
teaching community. * 

The quality of its program was judged by ambassadors, 
spies, heads of missions and others, If the "product" was 
not ofrSufficient quality, congress would soon hear of it 
and tjfie marvelous money well would soon dry up. 



Level 1 



y 



The FSj Proficiency Rating Scalf 
Level 3 



Able to satisfy routine traveU needs and minimum 
courtesy requirements. Can ask and answer questions 
on topics very familiar t t o him within the scope of his 
very limited language experience; can understand 
simple-questions arid statements, allowing for slowed 
speech, repetitiqn or paraphrase; speaking vocabulary 
inadequate to express anything, but the most 
elementary needs; errors in pronunciation and grammar 
are frequent, but can be understood by a native speaker 
used to dealing with foreigners attempting to speak his 
language. While elementary needs vary considerably 
from individual to individual, any pe'rson at level 1 
should be able to order a simple nr\eal, ask for shelter or 
lodging.^ ask and give sijnple directions, make 
purchases/and tell time. » * 

Level 2 v ' . i - 

Able to satisfy routine social demands and limited work 
requirements, Can handfe^ith confidence'bu t not with 
facility most social Situations: including introductions 
and casual conversations about current events, a^ w,ell 
as work family, and autobiographicaTinformation; can 
! hafidle limited work requirements, needing help in 
■b^dling any cc^mplications or difficulties; can get the 
gist of most conversations^on nontechnical subjects (i.e. 
topics which require no specialized knowledge) and has 
a speaking vocabulary sufficient to express himself 
simply with some circumlocutions; accent, though 
J often quief^ faulty, 'is i-ntelligible? *an usually handle 
4 elementary constructions quite accurately, but does not 
> have thorough or confident control of the grammar. 

* / - 



Able to speak' the language with sufficient structural 
accuracy and vocabulary to participate effectively in 
most formal and informal conversations on practical/ 
social, and professional topics. Can discuss particular 
interests and special fields of competence with" 
reasonable easef Comprehension is quite complete for a 
normal rate of speech; vocabulary is broad enough that 
he rqrely has to grope f<?r a word; accent may, be 
obviously foreign; controTof- grammar good; errors 
never interfere -With understanding and rarely disturb 
the native speaker. 

Level 4 

, s 

Able to use the language fluently and accurately pn all 
levels normally pertinent to "professional needs. Can 
understand and' participate in any conversation within 
the range of his- experience" with a high degree, of 
•fluency and precisian of vocabulary; would rarely be 
taken for a native speaker, but can respond 
appropriately even in unfamiliar situations; errors of 
denunciation «^ n d grammar quite rare; can handle 
informal interpreting from and into the language. * 

Level 5 * * ' >, 

Speaking proficiency equivalent \o that of an educated 
native, speaker. Has complete Tluency in the*language 
such that his speech on all levels is fully accepted by 
educated native speakers in all of its features, including 
breadth of vbcabulajy andidibm, colloquialisms, and 
pertinent cultural references. 
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The torei§fi" language tests that we had .been 
developing at ETS oyer the.yegrs were limited to skills 
of reception — listening comprehension and reading. 
They were objective tests —.multiple choice.— and 
veVy reliable. 

We know thkt productive skills correlate highly with' 
their corresponding receptive skills^speaking-listening, 
reading-writing, for relatively large groups, but th£re are 
always exceptions." A tongueless examinee might 
understand Portuguese perfectly. 

The objective, multiple choice tests might be extremely 
reliable, but they were, obviously not valid measures of 
face-to-face communication skills. Th^ Foreign Sercive" 
would sacrifice high reliability for validity.. The first task 
wps to develop a scale or series of descriptors. The FSI 
Sdale ranged from 0 — no functional ability in the 
language, to 5 — ability equivalent to that of an 
educated native r speaker. 

The problem with a numerical scale, particularly for 
pedagogs, is the tendency to co^ider the points 
equidistant. A 3 should represent the same 
improvement over a 2 as a 2 over a 1. A cone, however", 
illustrates better than a line, the properties of the scale. 

The , Foreign^ Service developed expectations for 
. attainment of different levels, by language, according, to 
hours of instruction. „ 1 . S 

Once the Foreign Servi<# had the scale in place, the 
next task was to develop a procedure whereby people 
could be put on the scale. The LPI. Or language 
proficiency interview, was the result. The LPI is; in a 
sense, an endless series of parallel tests, each urtqiue to 
the examinee. The interview lasts from 3'to 20 minutes, 
approximately. 

Structure of the Interview 

1 > > 

Every oral interview follows the same generar'struct^ure. 
This general structure, guides the interviewer by 
directing his or her attention to certain mandatory . 
aspects of the test. An interview may be divided int'd ' 
four phases: warm-up, level check; probes, and , 
wind-up. The level check and the probes take more time 
than the warm-up and the wind-u|D v At the very lowest 
levels, the limitations oHhe candidate's language may 
be such that the four phases will be indistinguishable 

•from each other. At thefery highest levels, neither 
warm-up nor wind-uq wm be necessary unless the 

'candidate has not been speaking the language recently. • 

The Warrti-up. The warm-up consists of social 
. amenities and simple conversation at a level that is very 
easy for the candidate. (At the lowest levels, this may 
not be possible.) There are three purposes to this phase 
of. the interview: (1) putting the candidate at ease: (2) 
/reacquainting the candidate with the language if- 
necessary: and (3) giving the interviewer a preliminary 
indication of the candidate's ievel. For candidates, the 
main purpose of thewarm-up is to>put them' at ease 
with the testing situation and to reintroduce them to 



thelanguage. The length of the wajm-up will depend on 
the)circumstances; .candidates who have nx^t sppken 
the language for some time may need to get bacipnto it 
gradually, while- others 'may themselves immediately 
shift ^he conversation to a higher level. 'Testers should 
never skip this phase, ^ but t,hey may shorten it 
fconsicferably if tfie candidate does not seem to need it. 

. One good way" to begin the warm-up is for'the tester to 
• introduce himself or herself to the candidate in the 
target language., Since introductions are usually learned 
early in foreign language classes, it is easy for most 
candidates to respond, opening the. way for. further 
conversational exchanges. 

For the interviewer, the warm-up serves the .important 
function of giving a preliminary indication<of the 
candidate's leveF. This preliminary indication must be 
confirmed because about a third of candidates answer 
questions at the level and in the style in which they are 
asked. The. best approach is for a tester to assume that 
the preliminary indication is to be checked in the next 
phase, the level check. In fact, the rest of the interview 
will be devoted to ascertaining whether this preliminary • 
indication is'accurate or not. 

The Level Qheck. The purpose of this phase is to find 
the highest level at which the candidate can sustain a 
speaking performance. To find the level, the interviewer 
must 'test the breadth and 'depth of the candidate's 
ability in the language. , • 

How fluent i£ the* candidate? Hfcn/v well does he or she 
pronounce t-^e language? Ho^, accurate is 'the 
grammar? How wide is the vocabulary? How cQrrect is 
the syntax? How native is the expression ftf ideas and 
concepts in tjje language? 

Sometimes the level indication given by the warmi-up is 
misleading, and the interviewer cap begin the level 
check too low or too high. If the test begins at too low a 
level, the interviewer can simply raise the. level of the 
questions and begin the leyel check over again. If the 
test begins at too high a level jThis problem can be 
caused by either the tester's questions or the 
candidate's answers), the interviewer must bring the 
level down. Starting at too high a level is to be avoided, 
sinc^bringing the level of an interview down is) difficult 
to do without giving the candidate a sense of failure. 

In the level check, testers should check a nuipber of 
topics (both interest and noninterest areas) to see if the 
candidate can perform consistently at the level in 
questioq. Can the candidate accomplish the functions 
with suitable content and accuracy? When the 
candidate successfully passes the lavel check, his or her 
performance provides a floor to the rating>~The next- 
phase aims at'finding the ceiling. 

The Probes. The purpose of this phase is to makesur^ 
that the level the interviewer has beep checking is thQ 
candidate's highest sustained level. To probe, the tester, 
should take Ihe candidate above the previous level 
several times in different ways: an involved question, a 



""situation, a conversation between two testers into 
which the caqiiidate is then drawn, etc. The interviewer 
, may also -want the candidate to ask some questions. If 
this phase has been successful, no candidate should 
leave. th^ testing room feeling that he or she has not 
*been tested to the limit of his or her ability, ; 

This' phase is purposely in the plural because therfe 
. should be several probes, at least three or four. Probes' 
. . should furnish clear examples of linguistic breakdown. 
Sometimes the candidate actually tells the interviewer \ 
that the limit has been reached by saying, "I don't know 
how to say that in your language," or "I know what I 
\ want to say 'but I can't say it." In other cases, a sharp 
, drop in fluency, a sudden groping for words, or a 
dramatic increase in grammatical errors give evidence 
Cf the linguistic breakdown. , 

> 

.1 If the interviewer has carried out the level check at too 
tow\a level, the candidate will probably be able to* 
respd^id ;o the probes consistently well. If this happens, 
then the interviewer must recommence the process of 
level cneck and probes and cdntinue until the ceiling of 
the cand date's proficiency is found. 

While the level check gives evidence of what 
%■ candidates can do. the probes show candidates what 
; ; t they cahnot do. Without this phase of the interview, 
candidates may appear to, be more proficient than they 
really ate. The\probes allow a tester to explain why a 
candidate's sp'eech is not at a higher level, providing 
diagnostic information with specific examples. 
■ ' • •> ' , \ 
v Experience'd testers learn how to interweave the level 
check and the probes, so .that the candidate is allowed 
to return to a level where;.perfofmance can be sustained 
. before being asked apVthfer higher-level question. 

The Wind-up. The purpose of this ptiase is to leave 
' candidates with a feeling of accomplishment after 
stretching their speaking ability to the limit. It is $\so the 
tester's last .chance to check out any aspect of the 
candidate's speaking ability that may still^be unclear. 
Normally, the wind-G^ should return to the highest level 
that- the candidate was able to '-sustain -during the 
interview. It may eyen be helpful, particularly at the 
lowest levels, to end the test by returning briefly to a 
topic discussed previously. It is." of course, always 
appropriate to close by thanking candidates for the 
interview. « * ^ 

« 

ETS was asked by the Peace Corps in 1970 to take over 
Peace. Corps Foreign Language Testing from FSI 
becaus^ the Peace Corps was overtaxing FSI resources. 
* ETS was requested to consider alternatives to, the FSI 
interview.* After two years of exploration, we decided 
that what they had. with some refinements, suited them 
very well. 

.. The Common Yardstick 

As an outgrowth of the lopg-standing involvement of 
Educational Testing Service's language staff with the 
. Foreign Service Institute '(FSI) proficiency rating scale 



and in response to •expressions of interest from the * 
British Council, the English Speaking Union, ,and 
German and Japanese agencies, ETS, in June 1979. 
sponsored a small conference to discuss the possibility 
and desirability of establishing a "common yardstick" 
(or yardsticks) to describe performance in one or more 
language skills. ' ' 

At the conference, descriptive Scales, of language - 
proficiency developed in various' countries and by 
international agencies such 'as the Counqil of Europe 
were distributed. Both theoretical and practical issues^ 
in the development and use of a single set of descriptive 
scales on an international basis were discussed. 
Background papers developed, from prior work in this ■ 
area were presented by ETS and by British Council staff, 
and other participants contributed information from the 
perspective of theirpwn organizations. 

There was unanimous agrefcment^ among the 
participants that development, of descriptive scales for 
all language skill areas should be attempted. It was also 
recommended that a small working group -from among 
the conference participants be designated to begin 
work on the scale development. 

In November. 1 979. a working group consisting of John 
Clark and Prot^se Woodford of ETS; Brendan J. Carroll. 
British Council; David Hicks. English Speaking Union; 
and Anthony Fitzpatrick. Deutscher Volkshockschul 
Verband met in London. The outcome of this meeting 
was the preparation, in rough draft form, of descriptive 
scales for oral interaction and writing and a general • 
outlining of scale characteristics for- listening 
corffprehension and reading. Following the November 
meeting, draft scale descriptions for "passive" listening 
comprehension (excluding oral interaction) and for 
reading comprehension were prepared. 

On the basis of these initial activities and the positive" 
general response obtained. ETS requested and received j 
funding from the foreign language area studies research' 
program (U.S.O.E7D.E.) for the current project to 
continue work on a common metric for language 
proficiency. £ J 

Brief Description of Project Tasks a 

Proposed further activities for this project included the 
■ followingNjhree tasks: > x# 

t v 

(1) Distribution of the draft sca|#£ to , recognized 
foreign language measurement specialists for 
their ^critique, commentary, and any .sugges- 
tions for revision. \ 

(2) Convening of a small group of measurement 
specialists to synthesize the recommendations 
of the reviewers and collaborate in the revisioh 
of the scales. The senior British member of the 

■'■ ^ international work group would be invited to 
, K attend -this meeting to provide a summary of 
4 similar, inputs by measurement specialists in 
Europe. * 
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(3) Assuming a generally positive outcome for 
. activities (1) arid (2), presentation and 
discussion of the language assessment scales 
' and recommendations (or future development 
• activities to implement the use of these scales 
to executive officers of foreign Tanguage. 
associations, government' agency wepresenta- 
'tives, and representatives of the international 
business community. 



Project Outcomes 



Taskd): Review by Measurement Specialists 

In December, 1980. selected foreign language 
measurement experts were sent project information 
ana 1 draft / scales for their review and comment. 
Individuals requested to participate in this review 
included: : 



Dr Lyle Bachman 
University of Illinois 

Dr Michael Canale 
Ontario.lnstitute of Studies 
in Education 

Dr. James Child 

National Security Agency 

Dr. Ray Clifford 

Central Intelligence Agency 

and Defense Language 

Institute 

Dr James R Frith 
Foreign Service Institute 



Dr. Helen Jorstad 
University of Minnesota 

Dr. Pardee'loyfe 

Central Intelligence Agency 

Dr. Adrian Palmer 
University of Utah 

pr. Howard Nostrand 
University of Washington 

Dr. G. Richard Tucker 
Center for Applied 
Linguistics 

Dr. Barbara Freed 
University of Pennsylvania 



Reviewers were' asked to give (1) their appraisal of the 
overall. merit of the project from both psychometric and 
practical standpoints, and-(2) specific^suggestions for 
the revisions *of the draft scales with the rationale for 
such revision. 



Task (2); 
Meeting 



ftieasuf&ment Specialist Working Group 



From the original group of reviewers listed above,- a 
smaller working group was selected to participate in an 
intensive two-day m'eeting at ETS to consider the 
commentis of all reviewers and to collaborate with ETS 
staff on the revision of each of the four language scales 
The following individuals participated in the February 
24-25, 1981 meetings ETS: 

Protase Wooqjford, Project Director 

John Clark, Principal Investigator 

Judith Liskin-Gasparro, Associate Examiner 
. Marianne Adams, Foreign Service Institute f 

Lyle Bachman, University of Illinois 

Michael Canale, Ontario Institute for Sludibs^^i 
, a Education * 

' James Child, National Security Agency 
. Ray Clifford, Defcfnse Language Institute 

Barbara Freed, University of Pennsylvania 



Pardee Lowe, Central Intelligence-Agency 
Howard Nostrand, University of Washington 

At .the E*?bruary 24-25 meeting, the participants 
discussed in detail the following issuep. 

1 . SkillSvRepresented by Scales 

It was. suggested that- the traditional four skills — 
listening, speaking, reading ^nd writing — be modified 
to' include 'reading, writing and "pure" listening 
comprehension as discrete, measurable skills, and that 
"oral interaction" be used to replace "speaking" because 
of the listening skill required in real-life speech contexts. 

2. The Number of Scale Divisions / 

The scales reviewed in connection / with th^ project 
came from 'a variety of U.S.. British, and European 
sources The OoOForeign Service scale was'thre most 
familiar to the participants. The utility of the various 
scales was discussed as well as a proposed* 8-level 
; scale (0-7) for oral interaction. ' 

3. FSI Scale/Oral Interaction Scale Comparison 

The relationship of the proposed 0-7 oral interaction 
scale to the FSI 0-5 (with -f 's) scale was considered. 

4. Scale Progression , 

Participants were also asked to consider whether the 
proposed scales provided for a smooth progression 
from one level to the next and whether any pair of level- 
descriptions was too close to allow Sox a meaningful 
distinction between levels. • 



5. Intra-level Consistency 



v\/fieth 



Participants were asked to consider ^ whether the 
descriptive statements within each level would apjjly to 
most persons within the ability group, i.e. to make sure 
that there^ would-be no descriptions of tasks or 
behaviors "too easy" or "too difficult" for' people within 
the level. 

*~6. Interstate Comparability 

Discussion centered on the degree to which the scales 
for the four skills wero consistent with regard to detail 
of description. ' , ■ . ' 



7. Individual Scale Aspebts 



Participants were asked to rank each of the four draft 
Scales on the following criteria: 

A. "understahdability," 

B. "real- life referencing," \ 

C. ease and Straightforwardness of use forrating 
oxaminee pof.foi/nance, 



I OH- 



D. 'priority for development. 



The participants wete also asked to consider the scales 
presented to them m light of the "ideal"- scale, that is. 
one' that would include nil of the features of^orhl 
* interaction that they consider ed important * 

As the meeting developed. "rt became apparent that the 
task at hand w^s eWemely complex, Consequently, it 
was decided that a major part of the effort would be. 
devoted'to oral interaction 

The fo-Wpwing were major outcomes of the meeting 

•(D*A commitment to some form o,f the 0-5 government 
scale The deliberations of the group demonstrated that 

v al1 of the members were -in some sense basing their 
reactions to the 'draft scales on the relationships of 
t'he^e newer scales to the government scale developed 

'by the FSI. Since the government scale is relatively 
better known and since it has a long and respected 

- history, it seemed most reasonable to tae&in with, 't as a 
base, making "adjustments to it that would not after the 
accepted understanding of the significance of Level 1 
proficiency. Level 2 proficiency, etc. 

(2) The realization that no scale currently m existence or 

* under consideration 'does as complete a job of m 
evaluating oral proficiency as the participants, m the 
February meeting would like. Particular concer'nf was . 
focused on such aspects of. language aVlity Ss register, 
cultural sensitivity, and. in general, the relationship 
between linguistic ability and \he larger area of 
interpersonal communication. White these issues arise 
mostly at the upper proficiency levels, there are some 
languages "for whichjhey, emerge as low on the 
government scale as Level 2. Time was. also spent . 
discussing and coming to a common understanding of 
the term "fluency" 

(3) The decision that further work is most essential at 
the 0-2 range. This i? the area ijT which most 
second-language speakers can expect to fall after 
taking advantage of the range of academic courses and 
extracurricular activities usually offered in secondary 
.schools and cblleges. Level p proficiency is usually < 
attained onfy after extended* residence in a country 
where the target la'nguage is spoken and/or through 
intensive or immersion-type language* study. It was 
•recommended that Levels 0, 1, and perhaps 2 tjg further 
subdivided to provide finer distinctions 

Givon'tho outcomes of the February mooting, it became 
apparent that further ■ refinement of the scale 
descriptions was needed before proceeding to the 
expected next step of the project, the convening of a 
meeting with executive Officers of foreign language 
associations, government agency representatives, and • 
representatives of the international business -com- 
m munity. The intended focus uf a meeting with rhese 
"user groups'* war, planned to )>e a presentation of th'o 
revised scales and a discussion of whether mid' to- what 
extent the scares* mot their specify needs in tlve area of 
language proficiency evaluation 



Since the group at the February 24-25 meeting had- 
recommended that further work be done -on theJower 
.levels of the oral interaction scale and, further, that 
.work, on the other skills be postponed while efforts were 
focused on the oral interaction scale, it was decided not 
to hold the meeting-for language association officers, 
government agency representatives/ and representa- 
tives of the international business community as 
planned. Instead, further work was done at ETS on the 
' expansion of the lower end of the oral inlerachon scale 

Task (3) : Final Scale Revisions ^ 

On October 6. 1 981 s thejinal meeting of the Common 
Yardstick Project was held at the CIA Language School, 
fronted by Dr. Pardee Lowe. The purpose of the meeting 
was to discuss and. if possible, reach consensus on the 
revisions to {he oral interaction scale and to agree on 
'future plans. The participants at the meeting were as 
follows: Protase E. Woodford. Judith E.' Liskm-Gasparro. 
and Ihor Vynnytsky from ETS; Professor Barbara Freed. 
Assistant . Qeao, for Languages. University of 
Pennsylvania; Dr Ray Clifford. Academic Dean. Defense 
Language Institute; Dr. Pardee Lowe. CIA Language 
School; Dr. Yvonne* Escol£ teacher of French -in 
*' Montgomery County. Maryland and program officer. 
National Endowment for the Humanities, and Dr. John 
L.D* Clark, Center forApplied Linguistics , , 

*< ■ 
The discussion at Jhe meeting focused oq the needs of 
the gqVernment* langjage schools and the" academic 
community in the area of the evaluation of oral 
proficiency. The expanded lower end of the government 
scale was presented for discussion, and both Dr. Clifford 
and Dr. Escola "agreed that it would provide valuable 
information for students as well as teachers of language 
programs. At' the CIA Language School, language 
testers often offer finer descriptive distinctions, beyond 
the official ratings, in their evaluations of. student 
examinees* Dr Lowe reported that the informal 
descriptions were very similar to the expanded 
"intra-level" descriptions prepare.d Uy * ETS?* This* 
congruence in the independently developed 
descriptions was- an encouraging sign, and it was 
agreed that further work ih this area would benefit from 
• the experience of the CIA Language School. 

The group discovered a second area of. .congruence 
between ac a define and government language 
assessrnont needs i,n the discussion^ the value of a 
bilevel v system of oral proficiency assessment" It vvas^ 
agreed that for other- than a few very specialized uses, 
there is little need to discriminate between levels of 
proficiency at the 3 to 5 range. Most .academic and 
profession>f*ieeds will be satisfied by proficiency at the 
3 level or lower, so it is at this lower end of "the scale 
that mosJ attention needs to be focused The group 
' iigreed Mhat the following labels for r.fnges of 
proficiency ropresonl realistK;\^senjjhoris * ■ 

0 elernoriKwV 

1 ... intermediate 

' 2 ' advanced ^ t V* < 

3 b ;;tiponoi ' ' \ 



InVirntnjrv th*« rf-ijt-r Ul >|. .,m.»;. i I ttu- riveting werr 

L. I!'. W ■ • 

lV« ti rlr.U'. <n Hi', u ■« fulfil".'." ' ' the T*&\?fi'.|r<i 

ili-t.n.ti.irr. at I i/ve<-- U an-! 1 It wV «» gn "d'that th.».r 
descriptions were dehntte : \ v ■ f i the nght tnuk" and 
that* iuMhef w-urk on thern weu'd be a kupca!, and 
Valuable Ii ! Step 

(2) Agreement on th* 1 usef ulness ut a bdovoJ system 
according to which further development efforts would 
be concentrated m the 0 3 rang*'. Individuals about 
Level '3 would bw designated as "superior If a precis*. 1 
level were desired, it could be provided via the 
traditional V.K.e to face interview 

A* 

(3) Acjrcjemrnt-th.it coordination ol efforts among the 
various agencies concerned with language proficiency 
testing is a major concern that must be addressed As of 
this date, a major cooperative venture, stemming from 
the Common Yardstick Project, has been' launched that 
includes ACTFL. ET'Si and the CIA Language School 

In addition, an invitational conference on language 
proficiency testing 'was hosted by Ray Clifford and the 
Defense Language Institute November 30-^ecember 1. 
1981 in order to discuss the current needs and projects • 
o^agencies msidj? and outside the government and to 
decide on areas of future development 
** 

Summary and Further Planned Development Work 

Although the Tmal stage of the project, t o the meeting 
with representatives* of language associations, 
government agencies', and the international business 
community did not take place as proposed, the new 
direction undertaken by the project will serv t e to build a 
stronger foundation to , serve these . constituencies 
better m. the long rura The contributions of the language 
professionals and linguists were valuable in defining 
arias ot strength and weakness m the existing scales, 
and especially in recommending that now efforts be 
based on the government definitions and scale T|ie 
decision to concentrate on the lower-end of the oral 
interaction scalCresulted in the creation of intermediate 
working definitions For several years, language 
professionals in academe have? recognized that the 
absence of' these expanded descriptions has severely 
limited y the applicability oj the government and 
interaction scale to eollegu'and hTyli school students 

• Further development work, stemming from the V 
Common Yardstick project, is already underway by 
ACTFL. ETS. and the CIA Language School After the " 
October (5 meeting, Mr Woodford turned over to ACTFL 
thoexpandod'desenptions of oral proficiency for level 0 

*«ind 1 ACTFL, which is working on the development ot> 
profieior^y levels as goals of instruction, undeT a granti 
from the international Research and Studies of the U ^jf 
Department of f (location, in turn. Ssked Or I owe of I he " 
CIA language School \y investigate tho Validity and 
accuracy of the t IS deser iptions ()i I owe, assisted by 
hn in*:; *ind professional collaboration horn, MS. 
designed .1 research pru|oet to determine (U whether 



tlit- expanded ■ intra level" descriptions correspond to 
real life language um» arui (?) "whether the intra level 
descriptions .ind independent raters will rank a group of 
tapes known to hr within a given level iri tho 'same' 
order The \cule with the expanded lower end will be 
taught to college faculty members m Spar^sti and 
French at the workshop sponsored by AC If L 1 JrtcJ 
conducted by LTS 'u'n.der. a grant to ACTFL by tho US 
Department of Education • ^ 

For the long range, development work similar to that 
which has been accomplished for oral interaction might 
bo undertaken for the other language skills" Although 
further development work beyond the scale-definition 
and review stage would require additional .financial 
support and would, also, of course, depend on the 
psychometric appropriateness and anticipated practical 
utility of the final scale descriptions, a fairly large-scale 
.test 'development/validation project could be 
envisioned as a possible outcome of the initial work 
This larger study could include each of the following" 
activities " 

(1) Development .of comprehensive measures of 
language skill's - encompassing and operationally 
defining the descriptive scales These would be very 
exhaustive and lengthy direct measures of each of the 
skills m question, requiring perhaps two full days of 
testing on the part of. each examinee. It is recognized 
that these criterion tes;s would not be practical for 
regular measurement purposes." but would be used as 
comprehensive "benchmark" instruments exemplifying 
the scale ^descriptions and against which presently 
available, ' more easily administered tests (or 
smaller-scope, tests yet to be developed) could be 
compared and validated 

(2) Development of validation measures external to 
both the large-scale 4 "benchmark" tests and any 
smaller-scope tests. These external measures would be 
expected io include both examinee self-appraisal and 
"second-party" Meg. classroom teacher, work 
supervisor) evaluation of the examinee's proficiency in 
the language skill areas at issue. These evaluations 
coufd take the form both of (a) direct utilization of the 
common yardstick scales (i.e.. examinees and 
second-pa|tV$ onS( < rvors would be asked to rate the 
per formalist vis-a-vis the common yardstick 
desoripto(3M»nd (b) use of more detailed and more 
"atomistic" descriptions of particular, languago-uso 
functions (j; g . "say tho days of the week," "buy clothor* 
in a deportment store." "talk about your lavorite hobby 
at some fength. using appropriate vocabulary"), which 
would be rated on a dichotomous (can do/cannot do) 
basis , ■ 

[L\h Large scale administration of the comprehensive 
"'benchmark'' measures, smaller-scope measures, and 
external criterion measures to a large and varied group 
of examinees, tor 'purposes of both construct/ 
concurrent validation ol tho instruments in question 
'and establishment of equating data relating examinee 
performance on the smaller scope tests to both the 
"benchmark" Ipst results and to (hoY.ommon yardstick 
descriptor; 



The exaot nattye and operational defails.of the activities 
.. outlined in 1-3 above, would, of course, have to be 
spelled out rnuch more comprehensively at a later date. 
The .intent at this point is , simply ^o give- a general 
, overview of trie kinds of development work that would 
seem to be logical' and, we hope, practically useful in 
extensions of the^initial development of the common 
yardstick descriptors. '.. 

Concjusion % . r 

. At the time that the cup^nt study vyas prpposed, the 
idea of a "common i yajaJlstick'* or uniform descriptors of 
' language proficiencjj/was being considered only within 
. a re^trictecTpopuia^ion oT measurement specialists and 
goverh7rijBP4-- : TOnnected linguists. The "yardstick" 
activities themselves and the^eports on the yardstick to 
major foreign.. language" education constituencies* have- 
created extraordinary interest in the project across 
. academic. levels. It is obvious now that'the original 
scope of this project was far too broad. A result of thg 
deliberations at the February meeting of the working 
group was a narrowed 1 focus "on a scale for o/?eof the 
skill areas, the one considered of highest priority .—. oral 
interaction. The well-known and respected scale' used 
by the federal government has seen limited use in the 
academic context primarily because if provides too little 
discrimination at the lower end 0.0 to 2.0. It is precisely 
at the/ower end of the scale where there is greatest 
need for evaluation of language skills in schools and 
colleges. 

The proposed .expanded scale is a prbduct of the 
wprking group's efforts subsequent to. the February 



1981 meeting and during the October 1981 meeting in 
Arlington, Virginia to refine and plan next steps. 

** • ' • ■ - ■ . • * . 

The work accomplished through this^oject ha^erved 
and vvill corityaueSlo serve a number of related. projects. . 

The forking group members* are activeJy involved in 
continued disseminafiomof the draft scale to various 
foreign language constituencies... The- expanded oral x 
interaction scale is currently undergoing validation . 
'under aiiACTFL-sponsored project and will*— ifprovec^ 
valid — likely become fhe "yardstick" for describing the 
ability, of American students to function in a real-life 
communication situation. Further; work on the 
"yardstick" including further development and 
refinement of the existing draft spales for. "pure" 
listening comprehension, reading and writing . is 
planned.. Support will °be soughffrom a Variety of 
screes, training programs for high school.and college . 
foreign language 'teachers are scheduled for. J-98Z 
These training pr9grams will focus on the evaluation of 
students' ability to. understand and speak in^ foreign 
. language in a real-life context. The scale con6idered for 
• use in the e^pandedf!%reVised. oral-interaction scale 
developed under the - current Common Yardstibk 
project. Among the sponsors of the training'are: 

•■• v } 

The American Councif dn the Teaching of Foreign 

Languages (ACTFL), ' ' .: 

The Northeast? Conference oh the Training of 

Foreign Languages. " % y • 

Educational Testing Service, 

Vassar College. ... . . 



"Clark, Freed. Liskin-Gasparro. Lowe. Woodford have reported on the "yardstick" to'su^h groups as Southern Conference on Language Teaching. 
Modern Language Association of America, Pennsylvania MLA. Flordia Foreign Language Teacher's Association. 
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BEYOND MEASUREMENT: USING EVALUATION TO IMPROVE FOREIGN 
...... v 1 LANGUAGE EDUCATION 



Phyllis L Hagel Jacobson 



Present Crisis of Confidence 



Background. Foreign fanguage program^ have 
increasingly come under attack in recent years. The 
following statements indicate the gravity of the present- 
situation facing the profession'.^Americans' incompe- 
tence in foreign languages is nothing - short of 
scandalous, and it is becprtiing worse"; "Our schools 
graduate a large majority of students whose knowledge 
and vision stops' at the American shoreline, whose 
approach to international, affairs 1 is provincial, and 
whose heads "have been /^tilled with astonishing * 
misinformation"-; and "All of this underscores that, 
crucial to any improvement of an intolerable situation, 
there is an urgent need for better trained teachers and 
for extensive^retraining of»those already, serving in the 
nation's classrooms, particularly in view of widespread 
expert agreement that the decline of foreign language 
enrollments is in large measure a. response to poor 
^instruction." 1 & . 

What- conditions have prompted public commis$i6ns 
and o v ther responsible critics to level thpse charges of 
- lactc pf competence or\ the part of teachers and lack of 
achi&ement on the part of students? One a.rfswer. is 
that the foreign lapguage profession .lacks convincing 
evidence on the effectiveness * of foreign language 
programs as a whole, and of commonly used teaching 
methodologies in particular. The evidence of student 
achievement generally provided by foreign language 
educators is not expressed in the /measurable, 
standardized terms commonly deemed acceptable to 
researchers, professional commissions and/or other 
measurement professionals. As Dale Lange noted in a 
letter to Representative Paul' Simon of Illinois on 30 
June 1980, . . the effectiveness of second language.- 
training at the college , and university level is as 
questionable as that of secondary school language 
training. There is no evidence in the literature of second 
language learning of such effectiveness; it is only 
assumed by those in higher education." Few 
methodologically rigorous studies hav.e been done on 
the effectiveness of foreign language programs as a 
whole, or on the effectiveness of particular 
methodologies. Those studies which have appeared, 
such as those done in Albuquerque on intensive 
elementary school Spanish language instruction, and in' 
Pittsburgh (NY) on the French immersion program at/ 
the . = elementary school level, generally pertain to' 
high-concentration, intensive programs, rather than to ■ 



the typlcdl one-period-per-day elective typ6 of program 

mfost commonly found in the nation's schools. 

» i ■ 

A second answer is that empirical evldencfe % of the lack 

IJfnstirdem'^c^ 

obtainable by the public. One example of this type of 
evaluative assessment is cited by Paul Simon: 

It seemed obvious at the outset that the B.A/ 
holder in foreign languages would be our prime 
candidate. However, it soon became apparent 
that most foreign language majors with a /B.A. 
degree are not fluent enough ...They could not. 
sit. at a negotiation table to discuss contractual 
arrangements or the purchase or sale (jf^a gjven 
. merchandise . . '. The other day I had/a student 
enroll in our MBA International Program. Her 
major was French. When I learnid/this, l4sked 
her a very simple question in French, namely, 
that surely she must speak French well. The 
. answer coming, back to me in English was. 'I, 
see I have to pursue my study of French some 
more.'" 3 * . / 

/• 



empirical 
and. by 



" ■ / 
Incidents of this type preside critics vyith 

evidence of lack of stgdent achievement. 

implication, of lack of teajdher.competence. 

Some critics also /accept" hearsay as empirical 
"evidence" of the./ lack of demonstrable teacher 
competence, as jllustrated by this correspondence 
between Paul Simon and Harry Reinert: "Responding to 
my letter of'i9/August 1980. in which I objected that 
'teachers at , these levels do not need additional 
training...fQfeign language teachers tiave been trained 
and retrained for more than two decades,' Simon wrote 
in a personal letter to me on 25 August 1980: . 'many 
foreign Janguage professionals tell me that language 
teache/6 do need training in more innovative methods 
than/those in which they were classically trained'" 
(Rei/lert. p.249). 



A^ai 



yailability of Evaluative Information. What factors have 
Constrained the availability of valid and convincing 
.program evaluative information to forestall or counter 
t^ese criticisms? First, according to the second chapter 
;of Simon's book, programs have vacillated in their 
orientation in response to external social and political 
pressures. Lack of consistent goals has resulted in a 
lack of reliable program focus and hence a lack of 



Strength Through* Wisdom: A Critique of U.S. Capability. Washington, D.C. 1980. pp. 5*8. 

„* * * " 

2 Cited in Harry Reinert. "Caveat Emptor; The President's Commission." Modern Language Journal '65(198 1). p.250. 



| Simon. The Tongue- Tied American {New York: Continuum'. 1 980), p. 124. 



standards/goals against" which program implementa- 
tion and student achievement can be measured) 

Second, for many yearfc, "evaluation" In. thie language 
field has been synonymous with "testing/ As Omaggio : 
states; "Evaluation is often ' thought of in the -very 
narfcow sense of .'tests' and 'grades'/' 4 She also notes 
that teachers- and administrators have "not fully 
understood the broader focus of program evaluation , 

• beyond the measurement of achievement via testing: 
"TenBrink asked preservice and inservice teachers to 

' answer the question, 'what comSs.to your mind when . 
you he^r the word 'evaluation'? The most frequently 
occurring responses were: Preservice teachers: test 
grades, achievement unfair, judgment: Inservice 
teachers: tests, measurements, grades, accountability, 
invasion of privacy" (p. 236), While the periodical 

^literature carries numerous articles on focused testing 
strategies, few . articles, deal with applying broader 
program , evaluation concepts to foregin language 
programs. Recently, however, monographs have begun 
to- treat this topic. 6 

> Even within the najrower domain of testing; there is still 
no consensus wfthin the profession as "'to suitable, 
reliable and valid testing methods. For example', each of 
the suggested standard teacher proficiency measures in 
the field has drawbacks serious enough to make it 
unacceptable as a* valid measure to a significant 
segment of the profession (Valette and Under, p.202). 
As Clark points out. V. . Even the FSI interview falls 
somewhat short, of being an absolutely face-valid 
measure of communicative proficiency." 

Third, foreign language educators have not been in the 
forefront of the "back to basics" movement which has 
forced a closer look at the major objectives, the 
instructional sequences and the evaluation 
^methodology of programs as a whole. Although 7 there 
have been recent publications on this topuc. the 
profession' as a whole has not organized to work either 
at the local or the state level to define the priority goals 
to be achieved within a program, even at a "minimum 
competency" level. Indeed, as perhaps a continuing 
reaction to events which occurred several decades ago. 
foreign language educators still resist efforts to develop 
national or state criteria for basic skills in second 
language education. As Reinert comments: 



Such* unequivocal imperatives regarding 
pedagogical^ priorities in fojeign language , 
study should alert members of the profession 
to some possibly undesirable consequences of 1 
the Cinrtmission's recommendations, Tfce 
rnonolithic approach found in the^f&rt has 

: ;the .familiar ring of the strict audiolingualism of', 
"twenty years ago: wterahs within the 
profession should be able to remSmber vividly 

■' the enormous pressure, which, was applied^to 

*i . assure national conformity to a single apprq'#6h 

# . to teaching foreign languages (p. 250): 

Concerted efforts have not been nriadfe to provide local 
or state needs assessment data wtjiich would allow 
professionals to determine themselves the objectives 
against which their programs could be measured. 

Fourth; the profession has seen a proliferation of new 
programs and approaches (e.g., FLjES. mini-courses., 
vocationally oriented courses, course? for "communica- 
tive competence/* etc.) which havd, in many cases, 
been hastily adopted l*and just as jquickly discarded 
before reliable information could be Obtained as to the 
/worth and/or effectiveness of such programs and 
methods. . * | . 

Fifth, most foreign language educators lack professional 
training in the area of evaluation. Few evaluators have 
the linguistic background which would allow them to 
work effectively with language educators, particularly 
in the cruciah area of test development. The lack of 
trained personnel is reflected jin the dearth o 
high-quality program evaluation studies. As Omaggio 
confirms. "Although. there is an abundance of literature 
pertaining to program evaluation iri general, the foreign 
language community has been strangely quiet about 
the problem,,, which leads onfe to suspect that 
evaluations are not being carried qut with any regularity 
or in any systematic fashion" (p.240). Indeed, a survey 
conducted in 1978 under the allspices of ERIC-CAL on 
program evaluation in foreign • language instruction 
concluded that "with some notable exceptions, there 
appears to be a low level of interest and activity in 
program evaluation of foreign language instruction .in 
the United States, with the single general exception of 
the periodic (and top often pro forma) visits of the 
regional accrediation associations" (Omaggio. p. 243), 



4 Alice Omaggio, "Looking at the Results." Building on experience-Building for Success, ed. June Phillips (Skokie. \( : National Textbook Co., 1 979). 
p.237. " • . 



5 For example, see Gilbert Jarvis & Shirley Adams, evaluating a Second Language Program (Arlington. VA: Center for Applied Ljnpuistic^ 1 97?>: 
and Rebecca Valette & Cathy Linder, "Measuring the Variables and Testing the Outcomes." Building on experience- Building for Success. ed : June 
Phillips (Skokie, IL: National Textbook Co , 1979). . .. / 



ft 



6 John LD. Clark, "Measurement Implications of Recent Trends in Foreign Language Teaching." Foreign Language Education: A Reappraisal, ed. 
Dale Lange (Skokie, IL: National Textbook Co.. 1 972). p. 223. 

. • y . .. • ■ . ■ 

7 For example see Renate A Schultz. "Back to Basics in the Foreign Language Classroom?" Foreign Language Annals 11 (1 978). pp.647-55; the 
reporron'he Central States Conference. "Alternatives. Basics, and Competencies in Foreign Language Instructor f Modem L ^9^e ^mam 
(1979) pp. 364-66; and Teaching the Basics m the Foreign Language Classroom: Options and Strategies, ed. Dav.d P. Benseler (Skokie. IL. 
National Textbook.Co. 1979). 
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Finally, the profession itself continues to be divided 
about the ultimate goals of language study. Some 
members hold ih a rationale which rests mainly on / 
aesthetic .and humanistic grounds, a rationale 
influenced by university- and college-level educators 
for whom the study of literature is viewed as th^ 
ultimate purpose of study at .the beginning 

• language-learning* level. Others,, generally at the 
elementary and secondary school level, view language 
learning primarily as'a means for communicating with 

.and understanding other cultures. Still other educators 
vfew beginning language learning as a tool to be used 
to' further international business and/or diplomacy. As 
Reinert observes: 

.-. . . . if the Commission did give us everything the 
professional associations requested, this 
shoujd .properly be- seen as an indication of the 
terrible price the profession must pay foK^S 
lack of unity and direction . . . Parochial 
concerns of the "component members of JNCL 
• have prevented it from speaking with authority. 
One result of this was that the Commission at 
its regional hearings' received mixed and even 
competing messages from different segments 
of the foreign language profession. We must 
conclude, therefore, tfiat the shortcomings 
with« respect to foreign languages in the 
Commission's report are at jeast in part due to 
the profession's unwillingness to settle internal 
differences in order to advance the welfare of ^ 
the profession as a whole (p. 252). 

These divisions within the profession further result in a 
core of educators who resist being held accountable for 
student achievement based on goals which they 
personally do not support. 

Ultimate Effects. What are the ultimate effects of this 
lack of evaluative information on the profession and on 
foreign language programs? First and foremost it has 
led to the downgrading of language study as a national 
priority in and of itself. A$ Reinert's analysis" of the 
President's Commission report points out, ". . .' Equally 
clear is that far from being the 'best thing that has ever 
happened to' foreign, languages,' the report's 
recommendations actually threaten the profession with 
loss of identity by reducing language learning to a_ 
minor supportive role within international studies" (p. 
252). : 

It has also taken the impetus and the responsibility for 
professional investigation, evaluation and improvement 
out of the hands of the profession itself. House Bill HR 
7580. "introduced 'in the House of Representatives on 



13 June 1980, by Representative Paul Simon . would 
provide for a 'survey of the effectiveness of FL training 
tn grades 7 through 12, and in community colleges", 
with an emphasis on . . , the desirability and costs of 
additional training for FL teachers in grades 7 through 
12, and in community colleges' (Sect.3;2C)" (p.252). 
Furthermore, the President's Commission report 
proposes as one of its^ecommendations the creation 
and funding of a "National Criteria and Assessment 
'Program" to "develop foreign language proficiency., 
tests, and report on,„ monitor, and assess foreign 
language teaching in the,U.S. (p. 13) \ . .This assessment \ 
program should be administered through competitive" 
contract by a nationally recognized professional 
association or agency with special competence in 
foreign- language studies" (p.42). And. finally, the 
Commission further recommended the creation of an. 
Advisory Council on Foreign. .Languages apd 7 
International Studies in each state: "These councils 
should be- comprised of approximately 10 persons, 
including teachers from all levels, representatives from 
business and labor and from major ethnic communities, 
lay people and at least one student' . . The Councils \- 
should advise the governor, the chief education officer 
and the state legislative body on Foreign Language and 
International studies ifi the u state, and make 
recommendations towards their improvement" (p. 43). It 
is significant of ■ the current state of affairs that this 
policy-recommending body (las /?o foreign language 
teacher representation specified by name; nor must the* 
"National Criteria and Assessment Program" necessarily 
involve'a recognized foreign language organization. 

Finally, it has led to : the further erosion of the public 
credibility both of the profession and of language 
programs in general. Even as distinguished an educator 
as James B. Conant has suggested that anything less 
than four years of a foreign language before college is 
worse than nothing (Simon, p. 94). Simon also quotes 
Wall'Street Journalist Vermont Royster as observing 
that "students going to college are deficient in language 
skills. Some stttl are when they leave. That is indeed 
appalling, and its long past time we raised some hell 
about it" (p. 97). 

Utility of Evaluation for Foreign Language Education 

How can evaluation aid the profession in countering 
these negative perceptions and in returning control of 
■the future directions of foreign language study to the 
hands of professional educators? 

The term "evaluation" has had many definitions within 
the growing body of evaluation literature. Current 
emphasis in the field focuses on evaluation as a process 



8 ( 



'See also David P. Benseler, "The 
profession: Present Status and Future Directions. 



American Language Association: Toward New Strength. Visibility, and Effectiveness as a Profession; 
ure Directions, ed. Thomas H. Geno {Middlebury. VT: Northeast Conference. 1 980). pp. 1 43-56. 



Our 



At the present time. ---- — » => = - -■ A . AA /inQ1 v „ onQ QA 

Needed: Stepladders of Foreign Language Learning." Foreign Language Annals 1 4 (1981). p. J/9-B4 



ACTFL has been taking the beginning steps towards a national criteria and assessment program: see also Regina H. Paul. 



10 Both Omaggio and Jarvis & Adams provide illustrative discussions of this point. 



-115- 



ERLC 



112 



which provides informatiqn useful for decision making. 
AlkVn makes a needed distinction between "evaluation" 
and "research": "On the one hand, there are studies 
designed primarily ta add to the body of knowledge^ 
(research), on< tjj^gther, th^se studies designed , 
primacilysjp proviOTBffjformation fbr decision making 
(evaluation)/ And JmP Lwo functions arg separate and 
distinct" 1 \DecisiqHptocJu't foreign language programs^ 
and staff training areWing formulated today by those 
outside the profession — the urgent first step, in- 
regaining control for the profession is. to be able to 
provide valid data to'tfce decision makers. 

The literature of evaluation provides several pre- 
scriptive models for the techniques of program 
assessment for the collection v of data and for its 
subsequent analysis; the reader needing training in 
these areas is encouraged to consult these references. 
There are, however, certain elements common to most; 
evaluation designs which, if carried out properly, will 
provide useful and reliable information about pjograrjis 
and staff. Each of these general evaluation activities will 
be discussed in relation to how they can help foreign 
language programs reestablish their credibility and 
provide convincing data to decision makers. 



Evaluability Assessment Evaluability assessment is 
defined as a process for determining in advance the 
likelihood of an evaluation's success. It consists of 
two stages: first, an examination of program 
characteristics dnd second, an evaluation feasibility 
analysis. Many foreign language "programs" are in fact 
not .comprehensive enough to ^upport a full-sjple 
program evaluation ar^d need to modify their approach 
to evaluation to reflect local conditions. Other programs 
lack qualified staff, as seen in this example provided by 
Simon: "Some years ago I received a letter from a n 
mother whose son had taken twp years of high school 
Spanish and received straight A s. When he entered the 
University of Illinois, he was not' admitted to an 
advanced class. They advised him that he had learned 
almost nothing in those two years of Spanish. When I 
chatted with the principal of that small high school, he 
told me that the teacher had only limited knowledge of 
Spanish. He was primarily a mathematics teacher" (p. 
123). Such programs, whose lack of quality and 
effectiveness is evident, -are unlikely candidates for 
full-scale- evaluations and would be weeded out by an 
evaluability assessment, On the other hand, a program 
which is deemed feasible for further study can be 
assured of the. utility of continuing with ongoing 
evaluation activities. " 



Needs Assessment Neefls assessment is defined as the 
determination of-*he difference between the desired 
status (of program, of learner, etc.) and the current u 
status (Popham. p^ 65). Qualify needs assessment data* 
can be used by foreign language programs and staff to: 

V . > , . .. ' ' . N ' 

• provide for community input afc to what is seen 

as important in language study; 

, • develop defensible goals for language study 
which are appropriate to local conditions, 
. ' philosophy and resources; 

• determine the present status of Student 
achievement relatFve to program goals;- ? 

m. determine the educational needs of students in 

• order to achieve program goals; ■ 

• design instructional 'sequences based directly 
' on student needs; 

' • determine prioritieS-for test development; 

• determine requisite teacher competencies; 

'. • provide evidence of what training needs of 
teachers are based on these identified teacher' 
competencies; ^, 

• provide valid data to the public as to the 
importance of language study to the nation; 

• provide valid data to the public as to the need'- 
for making language study an integral part of, 
rather than an optional adjunct to, international 
.studies. 

Needs assessment data, when categorized and 
aggregated on a national basis, can provide a defensible 
focal point for the joint efforts of the professional 
organizations. Unfortunately, needs assessment data 
has, to date, been notably lacking and thus, has allowed 
critics to level charges unsubstantiated by. hard data 
such as those quoted in both the President's 
Commission report and Sirjipji's book- Needs 
assessment data can also provide a defensible basis for 
making decisions on the design of a program which will 
be based on the expressed wants of the local 
community, the needs of the students and the 
competencies of the staff. • 



: ;: 1J Marvin*C. Alkin. Richard 



Daillak and Peter White. Using Evaluations: Does Evaluation Make a Difference? '(Beverly Hills: Sage. 1 979). p, 13. 
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1J Bruce Thompson. Jean King and Ellen fechman. "Evaluation Utilization: A Bibliography.' Unpublished paper (New Orleans: Tulane University. 
1971). . ■ „ ■ * 



Implementation -Evaluation.. Implementation evaluation 
id defined as an assessment of the actual status, of 
program activities as compared with the original 
program pfan. Such a-procedure permits an accurate 
description of exactly • what the progranr) being 
evaluated consists of in reality, rather thap on paper. It 
pjovides a cqmmon frame of<reference to all parties 
concerned, with the evaluation and allows for the 
explanation* of the differences between what was 
p.lanne^J and what was actually done.^An 
implementation evaluation can provide: 

. f a compnon definition within the professsiofl as 
to the "characteristics of different types of 
foreign language programs; * ■ 

• evidence that a program claiming to be of a 
specific type (i.e., "immersion," "individualized,"' 

k "audiolingual") is in fagt a program of that 

specific type; > 

• evidence that what is being measured by tes\f 
and other observations represents the learning 

„ environment which is in fact being fostered 
through a program's instructional and related 
activities. 

Implementation evaluation data can allow educators to 
establish program categories- and descriptions which 
will remain consistent across the nation. The 
implementation evaluation process will enable the 
profession to use a common set of terminology and 
thus, avoid some of the problems currently being 
encountered by the related field of bilingual,education7 
which has., seen a multiplicity of prpgram types, 
purposes^ and methodologies' all subjected to 
mislabeling under the general heading of "bilingual 
education." Implementation evaluation data will also 
allow for the setting up of comparative program studies 
, since educatorsxan be certain that prog rams. which are 
in fact similar in design, orientation and methodology 
are selected for Comparison purposes. 

Formative Evaluation, Formative evaluation looks at a 
program during its early stages while program elements 
can still be changed, if desired, in response td * local - 
conditions, This stage can last from the first day of 
program implementation until such time as no further 
modifications can be made and the program stands 
intact as an entity. As Jarvis and Adams explain, "The 
purpose of formative evaluation is to improve the 
instruction:' It asks, in effect, about the current status of 
the program so -that it can be made better. It is 
evaluation that is carried out during the development, 
implementation and operation of the program" (p. 6). 

; Formative evaluation data are solely needed by (and 
lacking in) most foreign language programs. 

Such data can b| used to: 

\ . . 

• examir* the effectiveness of current instruc- 
tional ifaterials towards meeting the program's 
instructional goals; 



:, t examine the .match between program goals 
and learner activitiei; . 

• look at the, matchu between testing content/ 
strategies and learner instructional activities; 

. *■ ' ■ 

• develop criteria for program self-assessment; 

• investigate the effectiveness, of various 
t teaching, methodologies in achjpving program 

goals; . ..' 



• examine* the relationship .between teaching 
' methodology and -program conditions (i.e., 

which' teaching methodologies are- best suited 
to the various types of programs?); " ^/ ; 

• identify the most effective components to the 
."foreign language teachier style";. , 

.# look at "innovative" programs in their early 
* \ years of implementation in order to identify the 
most promising elenfiehts and to increase thQir 
effectiveness (John Rassias' program at 
Dartmouth would be an example of 'such a 
program). 

Rigorous formative evaluation procedures will provide 
defensible evidence Idling to increased credibility for 
the profession's ejffortSt to improve instructional 
programs. Demonstrated program successes can be 
more quickly and efficiently communicated, both tp the 
profession and to the general public. Ongoing formative 
evaluation as an integral part of a foreign language 
program is a sine qua non for providing valid data to 
decision makers. The importance of formative 
evaluation for the professi6n is also stressed by Jarvis 
and Adams: "There is a need for a more formative 
evaluation in language programs. Rarely do we initiate 
evaluation even for the purpose of improving our 
programs. Even in the light of the very heavy loads of 
most instructors, evaluation is still far too important an 
activity to be so neglected" (p.7). 

Summative Evaluation. Summative- evaluation focuses 
on -determining the merits of a program at its 
completion. As Jarvis and- . Adams summarize, 
"Summative evaluation is terminal evaluation of a 
program that is already operational. Its purpose is to 
make judgments about a program's worth . : . . 
Ultimately, stfrnmative evaluation is tied to decisions 
about support and continuation of a program" (p.6). 

Jarvis and Adams go on to draw some useful 
. distinctions between "formative" and "summative'' 
evaluation: 

In a .formative evaluation it is important to 
obtain day to day feedback on specific aspects 
of the instruction. This information is. however, 
often fragmentary and of limited use in a 
summative evaluation. Conversely, an 
assessment of the impact of a new program on 



ERLC 



-117- 



114 



t,he inftage of language study in a particular 
school may be very important in;o summativo 
evaluationvbut is inappropriate for a formative 
purpose v ..F>ormative and summatiy%Qvaluators 
oftef> do not 'behave' in tfie sMtefyyay. A 
formative evaluator^s procedures may. be more 
partisan than a summative evaluator's 
approach. The summative evaluator must be 
objective and circumspect. A formative 
evaluator can use shortcuts, small samples and 
intuition In 'an effort to improve the program 



fut^Ve action regarding) an issue or program 
type of use has little to do with the actual conten 
evaluation results. Symbolic use refers to cases 
evaluation results are used indirectly for a var 
purposes, for examjSto, to garner political support, to 
substitute for a decision, or to discredit a disliked , 
^p&Hcy" (Thompson)'. * 

Regardless of which type of use is being targeted, threef 
ir elements in the process have been >hown to 
luence the degree of potential success. These 
ments qre: . \ ' \ 

' ; - •' ' ' 

1) - the "personal" Motor Tfie *pfer$etfal factor Js „ 
defined, as the active presence within ^an 
evaluation study of at least one person whp 
cares about the propess and its results and wqcr , 
takes personal responsibility for promoting use 

' of the findings. As Patton discovered., in his 
* studies on evaluation utilization: "Where sugh 
a qerspn or group was present... evaluations 
were tjised; where the personal factor was 
s absent, there -was a correspondifi§f.ly marked 
'absence of • evaluation impact. The personal 
factor represents the leadership, interest, 
enthusiasm* determination, commitment, 
aggressiveness, and caring of specific 
individual people." y 

2) The political considerations factor Within 
Patton's studies, (researchers) "found that the 
political factors consistently emerged to affect 
the utilization process, whether or nojt \he 
decision makers and evaluators , were v fully 
aware of the political implications of the study 
from the. outset" (p.49). Patton also identified 
the political, nature of the evaluation process 
itself: "the fact that people are involved in 
evaluation research; the fact that classification 
systems and categories are involved; the fact 
that actions and decisions are the desired 
result of evaluation; the fact that programs and 
organizations are involved; and the fact that 
information is involved" (p. 47-8). 



Tpp.e-7). 
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Summative evaluation clata is the most authoritative 
fcnd defensible information the profession can provide 
. to any interested parties. Such data can be used to: ' 1 

to ^ A • • ' ■ 

• provide replicable outOTme data for all types of 

foreign language programs; ■ * 

• provide comparative progranr^utcome data for 
competing progr^jn types; & . . ^ 

• determine effepfTve instructional materials and 
teaching.methodologiesr 

• identify exemplary programs" worthy of 
dissemination and/or replication;* 

• provide program cost-effectfveness data. \ 



Improving the Utilization of Evaluation Information 

Although data gathering and communication to 
relevant audiences should be ongoing activities, 
throughout the entire evaluation process, there is a 
priority need for the foreign language profession to be 
able to communicate that information to the audiences 
that need to use it the most. In today's climate; these 
audiences 'are generally perceived to be the decision 
makers, pepple both within and outside the profession, 
at the local, state and national levels empowered to 
make decisions affecting FL programs. A 



Recent discussions of evaluation utilization have led to 
the point of view that evaluation? jdata have many 
different kinds of use. 14 For example, such data can be 
considered by a local, client or by related external 
jtffcerested parties as an influence in making decisions, 
substantiating previous decisions or actions, or 
rtablishing or altering attitudes (Alkin. p, 232). Within 
WiT^context, evaluation use can be instrumental', 
Conceptual or symbolic: "Instrumental use occurs 
(where) action is taken in direct respohse to the results 
of an evaluation ... In contrast, conceptual use referjb to 
cases where evaluation results influence decision 
makers' current thinking about (and potentially their 




A third 
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3) the nature and format of the evaluation report. 
'As Atkin's team explains. 

The evaluator must call on both his personal 
and professional skills in determining the 
" appropriate substance of evaluation infor- 
mation. First, s/he must have a clear idea of the- 
peopled to whom the . information will be 
addressed; second, s/he must .decide what 
information, from all that has be^n gathered, 
will ^eth-ovided to the selected person(s) and 
in whaOWay. If it is well enough tailored to the 
users' specific concerns arid questions about 



of Evaluation: Towards Comparative Study: Evaluating Action Programs: Readings in Social Action and Educatidn. ed 



Carol Weiss, Utilization 
C.H Weiss (Boston: Allyn and Bacon. 1972). 

/ ; 

15 Michael Patton. Utiiization-Focused Evaluation (Beverly Hills: Sage. 1978), p. 64. 
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the program, we xan expect that potential 
u)Uity will be greatly enhanced , . , Closely, 
related to \ substance, 'of course, is the 
npprof5riatondss of the format in wh'ich.tho 
information is presonted. Whdt the evaluation 
* has to of fer in the way of data and^ights may 
bet exactly what *bo program (or other external) 
personneLwanfbr neeql to know, but unless it is 
in an intelligible form it may well be ignored or 
% considered useless. ^ersonaj preferences must 
be contftiered; sortie.; people are simply 
- . uncomfortable //vith * quantitative datdr far 
' ' ■ example (p. 2^3-4), ; . • - 3 S ,. N 

these research find.ings convey specific.'suggestions 
and recommended courses of action to professionals , 
desirous of ensuring that foreign language, program 
evaluation data will be used. Foreign language 
educators need themselves to b|Spme cpnversant withf 
the purposes and methodologies' of evalucrtiCn; to'look 
at their 6wn instructional settings -and programs to 



determine those evaluation activities most likely to help 
them both Improve their programs and demonstrate 
success to the public; to, take the initiative to embody 
that "personal" factor so crucial to the effective use of 
carefully obtained evaluation data and'to local public 
relations efforts to* counter Unsubstantiated criticises; 
to Identify local and national aecisron makers <snd the 



are 



political/social climate which these' persons 
operating *So that they may tjecome'knowledgeable and 
influential enough to participate in the decision-making.; 
;prodess^and t® identify the potential audiences for' 
'evaluation* data (e.g.,' pare.nts, " news media, 
administrators, etc.) so that the content and format of 
evSluatiorf reports can be tailored to suit the needs and 
preferences of each auidence. Finally.jand perhaps. most 
importantly, foreign language educators nded to maker* 
evaluation and evaluators their allies in the processlo , 
improve programs and thus ultimately to regain control 
over the direction of foreign language programs within. 
. the country. * ' ^ 
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USE OF NATIVE LANGUAGE TESTS . 
FOR PROGRAM PLANNING PURPOSES 

Flora V, Rodriguez-Brown 
University of Illinois-Chicago 
Illinois Resource Center ® , 



A Rationale for Testing (jjeeds in b Bilingual Clpssroonrv 

Since bilingual educ/ation programs supported by 
Federal and State funds are to 'be transitional in^neture, 
English proficiency . is tTie most relevant factor^ in 
determining-participation in bilingual programs. Most of 
the time, programs are designed to serve, children's 
needs according to t^ir different levels of~ Englfsh 
proficiency as' shown by their performance on different 
language proficiency tests. . 

Tests currently in uSe .which determine, bilingual 
program participation measure proficiency from many 
different perspectives. Some tests, measure only one 
aspect of language (i,e, vocabulary or syntax). Others 
include several variables which the authors define as 
being important aspects of language development 
derived from , their own conceptual framework of 
language. Whife some tesl^ tend to measure 
proficiency from a more holistic pragmatic perspective, 
others measure it from a discrete point perspective. A 
recent report from NIE (1981) describes findings of a 
comparison of results of ^he four majortests currently 
used to determine the English language proficiency of 
LEP children. In the study, the results of the different 
tests showed little correlation in terms of proficiency 
levels. It was found that the lack of correlation was due, 
to the fact that each test selected different aspects of 
'■language to measure in determining proficiency. 

Researchers involved m studies in this area' (Hayes, 
1982; Slaughter and Bennett, 1982jpand Rodriguez- 
Brown and Elias-Olivares, 1981) havSstarted to discuss 
the need to redefine the concept of language pro- 
ficiency. Central to the discussion is the idea of looking 
at language from a holistic perspective. In terms of 
bilingual education, these researchers urge a 
redefinition of the concept to clarify whether we are 
talking of that proficiency necessary to/succeed in 
school or in life. The idea of looking at language 
proficiency from the perspective of what the child c$n 
do, rather than what adults feel the child should be able 
to do at different levels of proficiency, has been posed* 
as a mdre realistic approach. Another area of "discussion 
is the relationship of the concept oManguage pro- 
ficiency to communicative competence and functional 
language ability (Slaughter and Bennett, T982; Hayes, 
1982; and Rodriguez-Brown and Elias-Olivares, 1981). 
From the view that language proficiency should be 
measured as part of communicative Competence, then, 
tests and other measures should take into account not 
only what the child can do in L 2 (English) but, what s/he 



can do in his/her native language (L Caswell.- It seems 
relevant to know not only the levels of proficiency in L 
gnd L but alsp how the child uses, language (L 1 Or L'J 
' in relation to the setting, topic, and formality Jevels^ 
While natural language samples seem to give a morq 
holistic perspective of what the N ctolcf can-do in <iach 
language, their use is restricted, -due to lack of 
standardization. Because of the need to look at 
Janguage from a holistic framework, the need to look at 
proficiency in terms of and L 2 and the need to 
examine proficiency from a sociolinguisticyperspectiye, 
an assessment of a child's fanguage proficiency in both 
the L and L will be required. In this way, decisions 
regarding program planning will be made according to 
what children already know. This will 1 facilitate the^ 
learning of children enrolled in bilingual programs. ^ 

Research reports by^Skutnabb-Kangas and Toukomaa 
(1976), Cziko (1976). and Tucker (1975), among others, 
have suggested a strong relationship between the 
development of L and the learning of These 
research results edd support to the rationale for a need 
to asses,s\L and L 2 development in children attending 
bilingual programs. - ' 

Skutnabb-Ka'hgas and Tbukomaa's .study (1976) has 
..shown that Finnish immigrant ^children , who »were 
instructed in Swedish -(their L ) before they reached the 
preoperational stage of development tended to have 
problems learning their L 2 (Swedish) and achieving at 
gr^dejHevel With other students in the system. In 
contrast, Finnish children who began instruction in 
Swedish (L ) at an older age and who had established 
their first language skills before immigration approach 
the achievement levels of majority (Swedish) children. 
These data suggests that a certain level of development 
in L is a prerequisite for learning L and achievement in 
school. Cummins (1979) stated that before £ilingualism 
can become -an asset in learning, one must reach a 
certain threshold level of language development in 
and L . . 



Along with these developments 
relationship* between -L , and L 2 
several researchers have shown 
beyond langS^e when planning 
"bilingyalV 1 children (Bowen, 1977, 
Tucker, 1977; Rodriguez-Brown, 1979; 
1982. These . researchers emphasize 
examine, the child's established sociocultural traits to 
determine issues such as participation in the bilingual 
program and media of instruction in the program. - 



which show thlr 
and achievement, 
the need to look 
programs , for 
Troike, 1981; 
and Cummins, 
the need to 



1w Bilinguar children is used in this paper to incl/de children whose home language background is not English and who show low proficiency in 
English. As such, the term may include those who only speaM.^ 
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In spitoMhisJosonrch evidonco which cm provide so mo 
direction inl>iling^WLprogmm design and implementa- 
tion, most of .thu mlrisions currently being mndo.aro 
based solely on tho results -of English-language 
prolicioncy testing. Commonly tests. used to moosuro 
English-language profi&oncy aro narrow in scope, do 
not assess common areas or accurately indicate a 
child's ability to coVimumcato. II wo want to plan 
programs that bettor servo the needs of children 
attending bilingual programs, it soems necessary to 
emphasize the need for,the teacher to know what oach 
child can do linguistically in both L, and L , his stage of 
cognitive development or readiness for learning and ( 
.sociocuUural 'information'that might 'help enhance the ' 

Jt program's impact. If teachers had this type of 
information, they would be able to make better 
-decisions regarding instruction to enhance children's 
participation and learning opportunities in the program. 
While teacher. observations can aid in collecting most 
of the data described before, testing is a most useful 

' t6ol in gathering program planning information for 
specific children. 

Some difficulties with formal measures have caused 
teachers to acquire negative attitudes toward testing. 

■-. They feel they hav,e to administer too many tests and, in 
some cases,* do not receive the results. Furthermore, 
when they do receive the results: they often do not 
know how to interpret and/or use then?. Teachers find 
that time spent in testing reduces the^amount of time 
available to* deliver the curriculum prescribed by the 
schooUdistrict. In the case pf bilingual teachers, it is 
important that the benefits which testing. results may 
have for designing and improving instructional 
program's become more evident. It is „ important to 
emphasize - that whether bilingual programs are 

transitional ca not, native language testing results offer 

^a rich source of information which will facilitate 
decisions regarding the language used for instruction, 
' teaching methods, grouping and selection of 
curriculum materials. Wise decision making-* will 
.enhance achievement and_ compensate for 
environmental situations the teacher may not be able to 
control directly 

The use of English language proficiency test results 
/have been overemphasized in.bilingual program design 
'and planning. It is crucial, following current research 
findings (Skutnabb-Kangas and Toukomaa, 1976, 
Cummins, 1979) that we try to look beyond English 
proficiency For children who attend bilingual programs, 
t it is important to look at native language proficiency as 
well as readiness skills and previous school 
achievement The following section of thiis'Raper will 
.» address the need to administer native language testing 
; in different areas to enhance the quality of instruction 
* - provided in bilingual programs. . 

Application of Native Language Test Results in bilingual 
Programs 

Presently in the state of Illinois, bilingual programs have 
to report native 'language testing results for children 
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attending tho programs, As such, tosting in native 
languages will be roqulrod to fulfill state bilingual 
oducntion guldolinos.- Teachers, then, should toKu 
advantdgo of ' this requirement to collect native 
languago information to aid in planning' plassroom 
activities /spocific to tho noods of cliifciron attending 
bilingual programs. In choosing tests, administrators < 
and toachors should choose thoso tests which beat 
cdrrolato witfi the educational objectives of tho district. 
Tests should provide users with variod information 
regarding both what children do and do not know, as ^ 
well as their aTl&as of strength and weakness. I . ' 

In regard to native language testing, £>r)a of the first 
arcfas of concern is to determine with accuracy the 
native language 'of th6, child. This may not be a problem 
if we are dealing with Hispanic children, but it . has 
prdven to be a problem with Indochinese students. 
Some of these children, have lived in several countries 
before they immigrated to the United States, and as a 
result, districts tend to assume that the language 
spoken in the last country (or area) of residence is the 
child's native language. Indochinese students may have, 
been in contact with several languages and their 
knowledge of each will vary Parents should be involved 
in the process of defining the child's primary latiguage 
before any native language tests are administered. 

Since studies (Tucker, 1975, Cziko, 1976) have shown a 
strong relationship between L - and L ? development in 
bilingual children, it seems important, then, to look-at 
the proficiency of the students in the two languages. It 
is necessary not only to compare the proficiency-levels 
in L and L , but to identify areas of stVfength and 
weakness in ?wo languages. When looking at language 
proficiency in L and L . it is important to look at the 
proficiency levels by analyzing the results of more than 
one measure. Multiple-part tests which measure 
different aspects of language,, observational surveys 
such as the Student Oral Language Observational 
Matrix (SOLOM California State • Department of 
Education, 1981) and teacher informal observations 
used together can give a more valid and reliable view of 
the students true proficiency, including areas of 
strength and weakness. For Hispanic students, a test 
such as the Woodcock Proficiency Test (Woodcock, 
1981, in English and Spanish) may provide the teacher 
with useful diagnostic information for classroom, 
planning. 

Due to the high priority given reading in school 
curriculum irg the United States, this aspect of 
proficiency receives special attention from program 
planners. Districts are interested in determining • the 
children's reading ability jn the native language to place 
them at an equivalent leyel in the test measurement 
providing program direction. The best usie of native 
language reading test results is that of finding out what 
children can do in their native language. For program 
planning, it is important to know not only the highest 
grade level attained in L but also how relevant the 
objectives of the program he attended in L were to the 
L instructional program the school district will be 
providing for the child. 
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( Wo know, ilioro Irom common Honao thnn from 
rnsoiircl! finclinna. that some reading skills tiUnalor Iroin 
L. to I This tolh uh u child who has soma bnckarouiulV. 
in rending in talons not hwv« to skirt to loom to road in,. . 
English Iroin scratch. It is lor this rotison that it ft ' 
important lo'lmcl out whothor tho child knows how to 

,road inX and which skills ho brings to school with hiny 
Nalivfl languago tost results c#yW bo o usolul tool iti 
dotormining a child's readiness to loarn. "Diagnos^ 
tests in L may holp dotqimino tho child's strengths and t 
woaknossos in roading, Besides tho usolulrujss ol those 
data lor docision making in reading instruction, thoy ' 

^ could also bo a valUablo aid for grouping and gOnoral 

" program planning. * " • ■ 1 • 



inltive 



Cognftivo development and readiness Ipvels^ 
determined via the native languago testing results c#i 
affect decisions in the bilingugl program. As educators,/ 
many times we judge bilingual children's achievement 
problem's as related only to their lack of proficiency in 
English. Very seldom do we think of whether the 
children have {he readine^s^ skills or the cognitive 
development necessary to perform .well 'in content 
aijeas in school. Tests such as the Boehm Test of Basic 
Concepts administered both in English and the native 
language of the children could help teachers determine » 
if children have acquired concepts which are basic for- 
learning. If the children have those concepts, they are 
functional in an English-speaking classroom. Otherwise 
they may need to simultaneovsly .acquire both the 
concepts and English skills. 



portionnul, and previous and/or current uchool rocorda 
as woll m lasting mloripiuion collected fn tho native 
languago of tho child Ihis toslinu inlomtotion could ho 
uSod to identify aroay of concorn wfiioh Hhould bo/ 
i further obsorvod and assessed 
\ 

In spito o) tho lack ol valid and roluihlo psychological 
tosts RoriHtructod in .languages other than English, 
school districts havo boon trying to tust chddroji w^i 
translatiohsvol English languago lasts, The loot that tho 
district 'is Conscious that tho child does not apoak 
•English is "dommondabJOi? put this is not onouflh. 
Mistakes hay^been mad6i*comTj^oyon in determining 
•the nqtivo language c>f tho children, Thoro are casos ' 
Whore children \havo boen tested in«a different foroifln 
language thdn their' native language with.translations of 
tests which wer^not reliable or valid when translated. 
Under these conditions^ it is important to think of the 

* number of"childron\who have already been misplacod in 
special . programs \following this .dubious process, 
Psychological testing is an area in which orroneous 
native language testing can be a liability rather than 
help for bilingual-children. This is why Jj/prerqferra« 

i assessment period that allows for the^tSollection o. 
observational $s well as diagnostic informqtion from 

* the teacher is so crucial. / 

> f'br Nativ^l 




Current practices irVf^f^al of bilingual children for 

psychological testing seem to overlook 'the fact that 
children attending bilingual programs usually have 

ajjer.deH schools in other countries where the language 
' of instruction as well as the children's learning 

experiences varied. The tact that children come to the . 
; bilingual program or the regular school with a certain 
*) degree of development in L 1 should not be overlooked* 
Ibefore making any judgment on the children's abilities. 

It is important for teachers and school districts* to 
collect enough observational and/or diagnostic- testing 
data, related to performance in the native language so 
that'possible areas of' concern hindering the student's 
learning capacity and/or producing" behavioral 
disorders might be discovered. Problem areas may be 
related directly to cultural and linguistic differences, 
thereby not suggesting additional psychological 
' testing, or they may be related to fundamental learning 
problems; ••- , 

School districts should establish policies in which, as a 
general rule, bilingual children are not referred for 
psychological testing at. least during their first six 
months of schooling in the United States. Additionally, 
any referral submitted after this period -should be 
accompanied by a thorough report showing the process 
used by the teacher to determine the need for a referral. 
This report should include information concerning how 
the native language of the child was determined 
* (especially in the case of Indochinese students), 
•observations made by teachers and other school 



Availability of Resources 



^Language Testing 



When talking about native language testing, two 
questions are consistently raised: first, . "what 
native-laftg-uage t^stsarfesav^jlable?'' and second, "who 
can administer/ native-language tests when school 
personnel in ^(district ar$T. not 'available for specific 
' language^ neeqec 

An effort to provide resources which showj the 
availably of^natiye-language tests js a survey done by 
the State- of .Illinois (Rodriguez-Brown and 
Starker-Rowe, 1982). The survey tried to determine a) 
whether the state offices of bilingual education or 
individual districts recommend specific native- 
language tests as part of the program design .and 
planning procedures, b) whether districts have 
developed tests m native languages, which can' be 
useful for other programs, and c) the availability from 
publishers^ native-language tests 'which can be used 
With bilingual students. ' « 

The survey showed that few states recommend any 
specific native^ language tests and furthermore that 
most districts surveyed did not require or see a need for 
native^Tanguage testing. In addition, a majority were 
unaware of available tests and/or procedures. A few 
states and districts, though, referred us to published or 
district-made tests in such areas as language 
proficiency, reading/ general achievement, math, 
readiness skills, etc. The- languages of the tests found 
were varied. Among them- were Arabic. Chinese, Greeks 
Hebrew, Italian, Russian and Vietnamese. A survey of 
'cu^ent test bibliographies in wide uss coupled with 
answers received from the^ states *and districts have 
helped us identify as many as 450 tests in different 
areas (language proficiency achievement, etc.) in 39 



gilluront lanmuagos 

Many ol lh»a« t«t?*t;« havo mil litittti mvmwiul yet Mctny 
of thtini am district made I hoy are not portm;t ( hut limy 
utitiih to ueive the need lnKyvhuih tho develops 
made tho tost I hose tests are not meant to prpvido a 
(null assessment ^nd/or \iiagnostic ol ntydnnt's 
problems, hut Itmy could hit usolul tonh Im Uicictitim 
iirui administrators to hud nut what the i:hilil knows in 
tho ruttivtj litrujiatyo when s/ho g*rts to school the 
results, of thn survey, luturi? update?! of concerns 
iivmloblu, data us well as a review ol tho different tosn* 
will bo a uselul source ol information t>n tho availability 
of ruitivo-ltinuuugti tosts in a variety ol languages and 
skill areas 
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In regard to tho administration of tests in languages for 
which bilingual toaching personnel aro unavailable, 
tnachftr aides, paronts arid other community people 
prcihtiioot in thcj language (and in curtain casos ovcri 
poors) can provide valuable assistance in tho testing 
Another question which has arisen and,, requires 
* attention is what to do when tests in specific languages 
do not exist. In this situation. Illinois is uorfimittod to 
tram teachers in the use of alternatives to/te$ting which 
can provide useful information especially about 
language development. Error (miscuo) analysis, native 
language sample analysis and the cloze procedure are 
alternatives to testing which schpol personnel can be 
trained to usjq with bilingual students to sample native 
language proficiency In this way, the dosign of various 
school programs can better serve, the needs of these 
children. J 

Error (miscue) analysis, for example, can' be used to 
determine reading comprehension skills in the native 
language, ?as well as the stage of development of 
reading skills in terms of the student's use of graphic 
versus semantic constraints in the text. This 
information provides the teacher with information 
which may .facilitate the design of a program that serves 
better the needs ;of individual children. The cloze 
procedure can be used not only to determine reading 
, comprehension skills, but vocabulary dnd, syntax 
development. Natural language samples can provide 
the teacher with a wider perspective of what children 
,can do linguistically withouUimitations to a particular 
* contexj.and setting. These samples, when analyzed, can 
provide invaluable diagnostic informationjn regard. to 
the receptive and expressive skiHs of children. These 
analyses can be focused not only on the formal aspects 



*>f language hut at the functional Invel <is wdl Natural 
language Hamplos provide the leachm with a more 
global viow ol <:hildi«n'b Imijuibtie skills whu h. in turn. 
Will facilitate the teacher a lultTm planning activities 
( whit:l* are relevant to, a particular child's needs the 
Vt:hool'b prugiams thtiu will ho moi« conducive to 
learning and better solve the linguistic: and acaden\|t: 
nundsi ul children in Bilingual programs 
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The purpose ol this paper is to emphasize the need for 
native language tosliun irt order to plan programs ol 
instruction that hotter so+rt) the* ruiods ol linguistically 
and culturally ditlerent children Until now, programs 
have focused principally, if not exclusively on fnghsh 
proficiency for program design and planning purposes. 

'RosriMbh evidence exists which shows a strong 
relationship botwoon proficiency in L t and and 
ttcfuovoment Native-language testing, then, can 
provide important information to plan programs which 

' take into account what the child knows and what s/he 
is ready to learn when s/ho gets to school 
Native-language testing has been carelessly used in ^ 
psychological testing of othnolinguistic minority * 
children. It is suggested that a moratory period bo 
established bofore referral of non English-speaking 
children to special programs is made whoreby teachers 
could collect data that will identify the true native 

4 languag^of tho child, as .well as collect observational 
andHfcagnostic data in the native language which will 
help determine the appropriateness of furthor roferral 
These data may sjiow specific areas of concern which 
may require further testing. 

Results of a survey of native-language tests presently 
available have shown that there are more available 
native language tests than most people behove These 
tests are not perfect, but they serve a neod. The results 
from these tests aro not finat; but they can help the 
teacher in decision making. Data from these tests can 
be collected not only by teachers, but by supportive 
staff, peers and community people. When tests are r>ot 
available, alternative methods of data collection can be 
used. Alternatives discussed are error (miscue) analysis, 
native language sample analysis and ihe cloze 
procedure. In general, the paper presents an argument 
for the need for native-language testing in order to« 
provide information to design relevant programs for 
children enrolled in bilingual programs. 
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IdnijUdg** amassment Kit htniimJ I ntjliah prohi lent 
(LIP) students enrolled in vui dtiutidl .oducdtion 
plgQf.iM^ requites auiim unusually interesting 
gOnsulerd turns Unlike in most I St and hilinyudl 
{nogrdfns whoic* LLP students must^lriye to mdatftf the 
total hnyu'istu: system ul tnghsh in d vm dj^ncil 
training aiiuittuet the langudye component ia hunted to 
that ret|fnred foi HJfVtVrtl 111 the V04 alfunal education 
♦ Oldssruonv tahordtur y. uf £hup and fof aun.»^ Oh the 



job 



Before the re<vl«*' iraii lully appreciate the uniqueness ut 

the VOCri'tlOMril Situation. It iS mipOMtfnt tO 1 lerifly 

illustrate the diffidences dinon.g vm ationdi I nghsh 
lanyudye irmUucbon dhd other types of FSL program* 
Somu ut those program typos are summarized below 

\ Genial LSI Ihe purpose of th% program »s to 
provide students with all ot the lovC%. structural. 



and phonological Components o 
sequence ot instruction is us 
yraminalical structures." from the 
frequent to the most complex 
techniques only 'an go from rugt 
repetition, substitution, convorsio 
,rner^>n/;itiun) to loss controlled (qu" 
discussions, and role plays) * 




English 'The) 
based on 
and most 
atfuctignal 
Mod (eg 
id log ue 
'manner. 



ESP (English for Special Purposes) The purpuse of 
this program is to provide? students with ,fhe 
specific: English Components and skitl^ needed to 
function in a given situation Although job related 
ESL technically fits under this category. LSP 
programs in the U S have. th\is far. exclusively 
served students wittj strong academic skills who 
are preparing for professional level careers 

Prevocational ESL The purpose ot this program is 
to provide learners with the "English necessary to 
survive in a US community The sequence of 
instruciion is usually situational and includes such 
topics nrs "calling a doctor," "opening a bank 
account," "public transportation," and "reading 
want ads " 

VESL (Vocational ESL) The purpose of this 
program 15 to provide learners with the English 
language components and skills necessary to 
survive in a vocational education classroom arid a 
job The sequence of instructionjs situational and is 
related to topics in the given occupation In an ideal 
setting. VESL instruction is provided simultane- 
ously with bilingual vocational education (BVE) or 
bilingual on-the-job training 



pespite the fact that job ioM(etJ FSL program* are 
rapidly mcredaing there 1* a aenuua kick uf appropriate 5 
assessment instruments Tor e*ainple according to tha 
AtffHinistfsttoi's Manual fat the Ihlmgual Vocational* 
ihai ftofcivnev test (Milton Pemnsuld. lt)H1| 

Although instruments hdve bean developed to 
detemnne tht* level" ot frnglrah prohcipncy of 
children, thoafe testa ara not appropriate tor an 
ddult population rurtf\«imor«, moat of the 
Enyli^h proficitmcy tests dtrvelopeoV for adults 
were deigned to te**t the lartguage-required tor 
college* and university work (p 2) 

Becduse ot th« sirnilaritioa b^tw«aa VESL ar»d ESP|, and 
because ot ISPs 'he«tf ntrtrt" rn the areas of program 
and materials development, one might 'suggest that 
VI Si educators modol the assessment process after 
that employed by ESP educators However, there is an 
equally serious lack of assessment measures in ESP. a^ 
pointed out by Strevens ( 1 9/7) 

It remains a major shortcoming LSP that very 
hi tin work has been done to devise fresh 
methods of testing, examining and assessment 



mm 

that match the new*courses of training (p 129) 

The purpose of this chapter 1$ to presont' some 
important considerations in assessing non arid 
limited English proficient vocational student 
Although discussions are included about VESL 
placement testing and vocational skills assessment, the 
major focus will be on the* process of deriving 
occupation -specific content for VESL curricula and 
classroom achievement tests ' A discussion of 
appropriate testing techniques is ajso provided 

Types of Assessment for BVE fl^VESL 

The assessment process for LEP students enrolled in 
vocational education programs is a complex one that 
requires, a good deal of collaboration among VESL 
instructors, vocational instructors, and vocational 
counselors We see this process as containing five 
essential components 

1. Initial assessment of English language 
proficiency. * 

2. Initial assessment of vocational skills. 

3 Vocational skills achievement (for progress). 

4 VESL achievement (for mastery). 

5 VESL achievement (for progress,). 
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Initial Assessment Of Vocational English-Language 
Proficiency 

An assessment of English-language proficiency should 
take place as soon as the LEP students enroll in the 
vocational education program. This assessment serves 
many important functions. . j 



1. 



3. 



4. 



It can provide a measure for VESL placement 
purposes.' 1 

It can provide a measure to determine whether 
the assessment of vocational skills must be in 
the native language or whether it can be- in 
English. . ./" ' 7 \- * . ; . 

It can provide a measure to hejp* determine 
growth in. English "proficiency (achievement) 
during the VESL training period. 
It can p'rovfde a measure todet&rmine whether 
and how much vocational instruction in the 
native language is needed. '/*..; 



Bradley and Friedenberg (1982) presented three 
jpoftant characteristics of an English proficiency test 
or LEP adults: u * 

1. does not require reading, 
'2. is appropriate in content for adults, 
3. will elicit thas^udent's ability to communicate 
a message as opposed to assessing formal 
* grammatical correctness (p. 147). : ^ * 

In addition to these characteristics, it is also important 
that the test not focus on content with which only those 
with stroniL academic backgrounds could feel 
comfortable $nd that it reflect the structures used mpst 
frequently in vocational education classes. Figure 1 
illustrates the linguistic -TStructures which have a 
particularly high frequency across vocational education 
classes. ■ 



Figure 1, - 

. v * * 

Linguistic Structures Which Have a High Frequency 
across Vocational Education Classes* 



SENTENCE TYPE 



N VtrN ■ 

NbeN (NVbeN) /; 
N beAdj (N V be Adj) ■ 
N be Adv (N V be Adv) 
Noun VtrN N 
NVintN 
NVintAdj 



Noun Transitive Verb Noun 
Noun Verb "be" Noun . 
• Noun Verb "be" Adjective 
Noun Verb^be" Adverb 
Noun Transitive Verb Noun Noun 
Noun Intransitive Verb Noun 
Noun Intransitive Verb Adjective 



-VERB TENSE 



Pres 
Past. 
RresProg 
Modal * 



1 



TRANSFORMATION 



- Wh ' 

Yes-No Ques' * 

Comp j 
' Imp' 

Cl-Adj 
01 Cl-Adv 

C)-hloun ~ 

Pass 

Part 
v Ger - 

Inf . 

Do > 
' Neg 



Present , 
Past ■ 
„ Present Progressive 
Modal 



Who? What? Wheo?'Where? Why? Question 

Yes-No Question 

Compound 

Imperative 

Clause-Adjective y 
Clause-Adverb 

Clause-Noun - 

Passive 

Participle 

Gerund 

Infinitive .>> 
"Do" Insertion 

Negative '••*'. - t 



MODIFIER 



Prep Ph-Adv 
PrepPh-Adj 
Adj . ' „ 
Adv 
Intens 



Prepositional Phrase-AoJ/erb 
. Prepositional Phrase-Adjective 
Adjective , 
Adverb- 
Intensifier 



MeltoBbPeninsula. 1981. p. 4. 
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Ideally, the Cessment of English language proficiency, 
in which , an' appropriate standardized tesf is used, 
should W a carefuK collaborate effort between jha 
ESL instructor and* vocational .counselor. One t|st, 
which was developed expressly for the purposes staled 
here, is the Bilingual Vocational Oral Proficiency Test 
(BVOPTT. However, if a test such as the-BVOPT is 
unavailable or j£ there is ho one able or wMling to 
conduct a . fornjal 'measure of English language 
proficiency, the vocational instructor should informally 
assess, the students' English abilities by interviewing, 
each of them in class or privately. This will, at least, 
provide a rough indication of whether the students are 
fluent, limited, or non-English-speaking. 

Initial Assessment of VQcational Skills 

The initial assessment x)f vocational skills is conduc 
by the vocatiqnaHnstructor to determine the poinfl^t 
which vocational instruction must begin for each 
individual student. The assessment activities normally 
consist of a series of instructor-developed psychomotor 
and cognitive items designed to measure student ability 
prior to instruction. Thus, in a typical pretest, the 
student might be asked to explain and/or demonstrate a 
procedure or process.. Language-related problems often 
' cause the LEP student to score less welUon this test 
than is warranted by his or her actuahvocational skills 
_ _ al^berefere^his^ssessment-sh 
in the LEP student's native language* 

Vocational Skills Achievement (for Progress) 

The vocational instructor develops course objectives 
based , on ' the tasks' derived through occupational 
analysis Learning activities are the'n developed that 
enable the student to attain mastery of each objective. 
Finally, appropriate methods of assessment are 
designed.* • \ 



3rform, 



Appropriate assessment requires the student 
perform what the objective asks him or her to per 
under conditions specified in the objective. For 
example, if the objective calls for the student to "give*a 
shampoo," appropriate evaluation would require the 
student to actually give a .shampoo. Asking a student to 
re// how to give a shampoo or to list the steps in giving 
a shampoo would not be appropriate assessment 
because the performance during the assessment differs 
from the type of performance called for in the objective. 

Objectives may 'call 'for cognitive, affective ,o'r- 
psychomotor performances, or. any combination of 
these behaviors. Therefore, a wide/ange "of assessment 
techniques is utHized with special emphasis on 
demonstration of psychohiotor performance. In the 
"case of LEP students, directions and even test items, are 
often provided in the students' native language. For a 
more complete discussion, see Bradley and Friedenberg 
(1982). 



Assessment of VESL Achievement, (for Mastery) 

The purpose of an achievement test for mastery is 
mainly to assess the degree of success of the 
instructional program. Such tests are usually used for 
research purposes to evaluate the effectiveness of a 
certain type of "experimental" instruction and for 
specifically funded programs'to determine success and 
whethecjunding ought to be continued. They are also 
useful to concerned administrators who want a general 
picture of how successful a given program is. 

In a VESL program, the instrument used for assessing 
overall. VBSL achievement should be the same 
instrument used for the initfal assessment of vocational 
English-language proficiency. If. the measure is to be 
used fqr research or program evaluation'purposes, the 
testing rjnust be formal (e.g. like the BVOPT): however, if 
'one onw wants a general picture of program effective- 
ness, informal procedures, such as interviewing, may be 
employed. * , 



Assessment of V ES L Achievement (fo r P rogress) 

Assessing the achievement (progress) of vocational 
English is the most complex, yet the most important 
part of. assessing LEP vocationalsstudents. It is also the 
part which has been least explored. * 

Like in general ESC "courses, the content of the 
assessment instrument for VESL should be a reflection 
of the course. The cpntent of a VESL course-is based on 
language derived from the vocational content. Thus, a 
good deal of close collaboration between VESL and 
vocational instructors is essential. ' . / 

Deriving Curriculum Content 

The goal of vocational education is to prepare 
individuals for initial employment, upgrading, retraining 
and advancement in- business and industry. It is 
essential, therefore, that vocational Education courses 
be based upon the actual job requirements of business 
and industry. The, most widely used method of 
determining vocational education course content, is 
called occupational arialysisov task analysis. 

The effective use of ' task analysis to develop a 
vocational education course, assures course content 
based upon the skills and knowledge currently rejqujred 
of workers in the occupation being taught. In addition, 
task analysis provides the ESL instructor with concrete 
dat£ regarding the language that .must be taught and % 
' therefore, assessed. Following is a brief description of 
some of the important steps' irfvolved ir) planning 
vocational educaticjn programs. For a more complete 
description of the process, see* Bradley and Friedenberg . 3 
(1982). 




The Job Description 

The first step in analyzing an occupation is to obtain or 
develop a written job description. A job description is a 
general statement that definesvthe scope of the course 
by describing all. of the kinds ofjduties a specific worker 
performs. A job description includes the various duties 
involved and lists any special or unusual conditions 
under. which the dutiejfrare carried out (e.g.. works 
out-of-doors, long periods of standing, heavy lifting, 
special clothing, and so forth). 

The job description promotes a job-oriented point of 
view that guides the selection of only the most relevant 
subject* matter and learning activities for the course. 
The job description also helps to communicate to 
students, community members, and other educators — 
including ESL teachers — ^the purpose of the course. 

Task Listing 

Many occupations req ui re more skills and knowledge 



would be listed under " one of the duties of a 
cosmetologist. 



Than»can possibly be fitted into the time available in a 
particular course. The instructor in such a situation 
must make certain decisions. Which skills are 
absolutely essential for the entry-level job? Which skills 
are required a little later on the job? Which skills are\ 
expected only of the experienced worker? The answers 
to these questions help the instructor make informal 
instructional decisions. 

Duties and Tasks. When analyzing complex occupa- 
tions such as automobile mechanic, cosmetologist, and 
nurse, it becomes immediately apparent that workers in 
each of these occupations perform numerous tasks and 
that certain tasks are logically related to each other. It 
makes sense for the course developer, the instructor, to 
place these related tasks into logical groupings. One 
method of placing tasks in logical groups is to begin by 
first identifying the major duties of that occupation, 
then fisting each task under the related duty^ 

A duty is one of the distinct majoj^activities or units 
involved in the work performed in arT^cupation. For 
example, analyzing the automobile ^mechanic's 
occupation would reveal that certain activities can be 
categorized as diagnosing, others as repairing, 
replacing, servicing and so forth. Each of these broad 
categories is known as a duty. Each of these duties is 
composed of a set of related tasks. Figure 2 presents the 
duties of a cosmetologist. 

After the duties have been listed, each of the tasks is 
identified and placed under the appropriate duty. A task 
is a unit of work comprised of a set of related behaviors 
(steps) that make up a particular portion of a job. For 
example, one of the tasks of an auto body worker is to 
replace fiberglass panels. All of the steps involved in 
replacing fiberglass panels serve to make up the 
complete task. Part of a waiter's job is to take the 
customer's order. Taking a customer's order, then, is 
one of the tasks that make up the waiter's job. Typing a 
business letter is one of the tasks of the clerk-typist's 
job. Figure 3 presents an example of the tasks that 



Figur<*2 
Duties of a Cosmetologist 

Mixing Supplies and Sanitizing Equipment 
Shampooing and Rinsing Hair * 
Cubing and Shaping Hair .. . 
Setting and Combing Hair into Stylg 
Cleaning and Styling Wigs and Hairpieces 
Treating Special Scalp Conditions 
^CurijagandBelaxingbiair 
Bleiffiihg and Tinting Hair 
Manicuring and Pedicuring Nails 
Giving Facial Treatments 
Keeping Records 



Figure 3 

Example of a Duty and Related Tasks 

Occupation: Cosmetologist 

Duty: Shampooing and Rinsing Hair 

Tasks/ 

1. Give a Plam Shampoo. 

2. Shampoo Lightened Hair. 

3. Give a Mild Acid Rinse. 



Sojne jojbs such as key punch operator, retail store 
cashier, and window washer are ordinarily referred to as 
single-duty occupations and a task listing is sufficjent 
However, when analyzing a multi-duty occupation, the 
process is normally begun by first identifying the major 
duties of that occupation and then the tasks related to 
each duty. . . v 

The instructor develops a task listing by reviewing the 
formal written job description. Then, since actual work 
experience in the occupation to be taught is required of 
each vocational instructor, the instructor reviews his or 
her own occupational experience and the literature to 
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develop an initial task listing. However, for accuracy 
and up-to-date information, : the instructor finds it 
essential to observe and talk with workers who are 
currently performing thfe job being analyzed, Thus; the 
task listing provides a list of all of the taslte workers are 
currently required to peftorm in the occupation being 
analyzed. 

Task Detailing ■ ■ 

Detailing is the process, of listing each step required Jo 
perform a particular task. Drawing upon occupation 
literature, personal experience, and observation of and 
discussions with experienced workers actually 
-performing the task, the instructor writes in sequential 
order every step required to perform the task. For 
example, detailing the task of baking a sheet cake 
would produce the 'following steps:, read the recipe, 
gather the necessary ingredients and equipment, wash 
hands, cream the sugar, shortening and salt, and so 
forth. The purpose of detailing is to be certairfthat^every 
aspect of e ach task is included, in the instruction. Figure 
4 presents the steps required for a cosmetologist to give 
a patron a plain shampoo. \. 
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Figure 4 






Example of Task Detailing 




Occupation: Cosmetologist 






Shampooing and Rinsing Hair 




Task: 


Give a Plain Shampoo, t 




Steps: 








1. Select and arrange all materials.. 






2. Sanitize hands. 






3. Greet patron. 






4 Seat patron cpmfortably. 






5. Ask patron to remove neck jewelry or ear 






. jewelry and glasses. 






6.' Drape patron. 






.V 






7. Remove any hair pins from hair. 






8. Examine condition of patron's hair and scalp. 




/ 


9. Brush hair thoroughly 






10. Adjust shampoo cape over back of shampoo 






chair. 






11. Adjust volume and temperature of water 






spray 






12. Wet hair thoroughly ' » . * 






13. Apply shampoo. 






14.. Give^manipulations. ■ 






15. Rinse hair. a 






16. A?pplv shampoo again. 






17. Give manipulations. 1 


V 




18. Rinse hair throughly 






19. Towel-dry hair.. 




1 — t - 


- 1 





When detailing a task, experienced instructors also 
often identify the major type of performance involved in 
each step. Many instructors simply classify the behavior 
as cognitive or psychomotor. However, a jgrovying 
number use the behavior categories provided by Mager 
and Beach (1967): Recall, Discrimination, Problem 
Solving, Manipulation and Speech, Identifying the type 
of performance enables the instructor to ascertain that 
-students will practice the kinds of performances 
required in the task. For example, if speaking to the 
customer is a step in the task, the instructor notes 
speech as the performance and makes certain that, 
when learning activities are selected, students have the 
opportunity to practice the exact performance required 
on the job — speaking to the customer. 

... ■ ■ . 

The task listing also often includes who performs the 
task (e.g. entry-level worker or experienced worker) and 
how often it is performed. Instructors also note the tools 
and materials being used while c the tasks are being 
performed. Therefore, task detailing also provides a list 
of tools, materials, equipment and supplies tha t must b e 
"obtai ne^" foT^the instructional" program. Figure 5 
presents a list of the todls, materials, equipment and 
supplies obtained while detailing the task of giving a 
plain shampoo. HOvyever, the primary reason that the 
vocational instructor details each task is to assure that 
every step of every task is included in the instructional 
plan. ..; * 





Figure 5 


Toofc, Materials, Equipment and Supplies List 


Occupation: 


Cosmetologist • 


Duty: 


Shampooing and Rinsing Hair 


Task: 


Give a Plain Shampoo 




Sterilized Towel 




Sanitized Comb 




Sanitized Brush 




Neck Strip . . 




Shampoo Cape 




, Shampoo (According to patron's needs) . 




Hair Rinse or Conditioner 



Task Analysis for LEP Students. 

A limited, but growing number of vocational instructors 
are professionally prepared in. bilingual vocational 
education methodology. ThesA trained instructors 
recognize the importance of Xh§ language associated 
with the safe and successful completionii^very task. 
Therefore, during task detailing, these inductors 
identify the special language associated with each task. 
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In addition to listing the names of tools, equipment and 
supplies, thesie instructors note the technical terms and 
phrases related to each task. They also record the "types 
of questions and directions workers respond to when 
performing each task, These instructors recognize that 
this "language of the trade" is an important part of the 
preparation of all vocational education students, but 
that for LEP students, acquiring the language of the 
tradeis a survival skill. 

The specially* trained vocational instructor of LEP 
students uses the language of the trade in at least two 
ways. First, the instructor includes these terms, phrases, 
questions and directions in his or her instructional plan. 
Second, the list of essential terms, phrases, questions 
and directions is shared and discussed with the ESL 
teacher. A cooperative plan is developed whereby the 
ESL instructor teaches the , needed .language at 
appropriate times and the vocational instructor 
provides opportunities for practice and reinforcement of 
the use. of the needed language as it is being used. Such 
collaboration between the ESL teacher and the 
""vocational instructdFt^essenfSrto essure that~ ESL' 
instruction is. in fact, job-rjelated and presented in 
appropriate sequence. 

Urifortu prate ly. only a limited number of both ESL 
teachers and vocational instructors are trained in the. 
techniques of collaboration.-However.~it-js-noudifficu!t- 
for. the ESL instructor to determine what and when 
language must be taught even when the vocational 
instructor is not trained in bilingual vocational 
education. 



Deriving CcJ^tent and Sequence for ESL 

When the vocational instructor is trained in bilingual 
vocational education methodology, the essential 
language of the trade is determined through task 
detailing and shared during the collaborative process, 
When the vocational instructor is not trained in 
bilingual vocational education methodology.. the ESL 
teacher determines ESL content and sequence through 
discussions with the vocational instructor. 

The ESL instructor is aware that the vocational 
instructor has a list of the tools, equipment, materials 
and*supplies required for each task. Thus, a lexicon is 
readily available. Task detailing sheets provide a list of 
the imperatives that must be learned and the order in 
which they must be learned. The ESL instructor begins^ 
the meeting with the vocational instructor by 
requesting a copy of the. appropriate task detailing 
sheet and a list of related tools, equipment/ materials 
and supplies (inventory), A few questions provide the 
. balance of thB needed information. 

The ESL instructor asks the vocational instructor 
questions, such as. "What oral questions or directions 
will . be- directed to «the student while performing the 
. task?" "What will the student be expected to say (to 
customers, peers, supervisor) while performing this 
task?" "What must the student read while performing 



this task?\The answers to questions such as these 
added to the .lexicon, task detailing sheet, course 
hand-outs and text provide the ESL teacher^ with a 
comprehensive knowledge of the language that must 
be taught, Sequence is then discussed and agreed upon. 

The ESL instructor then organizes the language^ 
samples collected from the vocational teacher. The3e 
samples will contain safety expressions, technical, 
terms, sub-technical terms, classroom-related 
expres&fons, non-technical words. 9ftd various 
^grammatical structures, 

ft must be remembered that the vocational instructor, 
whether or not trained^ in bilingual methodology, is not 
a language teacher The vocational instructor must 
receive guidance and suggestions from the ESL teacher 
regarding how and when to' reinforce the language 
learning, Thus, frequent collaboration is essential. 

Assessment Techniques 



"Just as the content "of "the VESLTachievement test 
should reflect the content of the VESL course, the 
testing techniques f should also reflect the learning 
activities employed during, the course. These learning 
activities should reflect authentic on-the-job or 
vocational classroom, activities as much as possible. 

-HenGerVESL^Iearning^Gt^ 

require students to integrate, language components for 
comprehension and production, as opposed to 
manipulative where students focus on discrete 
elements. \ * 



Up until recently, few ESL techniques had been 
developed which gave practice in integrating language 
components meaningfully. Now several language 
educators have exciting and innovative learning 
^activities that promote communication. These activities 
include discussions, information getting, reporting, and 
description (Savignon. T972); group ■ dialogue (Farid. 
1976); microcounseling (Friedenberg/^and Bradley. 
1981); and others. i 

If few communicative teaching techniques exist, even 
- fewer^ integrative testing techniques exist. The 

challenge is even greater for VESL students, many of 
• whom possess no literacy skills. The following sampling 

of activities may fjrove successful for VESL 

achievement testing which requires neither reading nor 

writing. / ' ' 

• / ' ■ . ■ 

1. Oral Presentation — students explain processes or 
*»r procedures. 

2. Pictorial Multiple Choice — students , choose 
correct drawing of equipment or process based 
on what they hear (taped or live). 

3. Acting Out Situations — students role play 
real-life situations. 

• 4. Paraphrasing — students reword an utterance 
they hear. , 
5. Physical Response — students carry out 
commands. 



9 
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Conclusion 



The assessment of »LEP vocational s^dents is 
comprised ol five essential qpmponents: 1) Initial 
assessment of English language proficiency, 2) Initial 
assessment of vocational skills, 3) Vocational skills 
achievement 4) English language achievement for 
mastery, and 5) En^ish language achievement for 
progress. This assessment process is a uniquely 
challenging one which requires collaboration among 
the vocational instructor, the vocational counselor, and 
the ESL instructor. 

This chapter focused mainly on the. considerations 
necessary to develop the content for vocational ESL 
achievement tests (for progress). Like general ESL 
courses, the content and techniques of the assessment 
instrument should reflect the content and learning 



activities experienced in the course. Unlike general ESL 
courses, however, the content, of a VESL course 
contains only the language necessary to survive in the 
vocational education classteom, laboratory, or shop and 
for success on the job. ThissQgntent is derived from the 
task analysis. * J> 

Although slow is coming, bilingual vocational 
education and vocational ESL programs have become 
important developments in our field. They help 
language-minority youth' and adults to enter the job 
market arid gain self-satisfaction and adequately, and 
they contribute to the economic growth of our nation. In 
order to increase the effectiveness of these programs, 
we must strive to develop well-trained staff, suitable 
instructional materials, and assessment instruments 
that evaluate LEP students reliably and fairly. 
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TEACHING AND TESTIN&BOTH COMMUNICATIVE AND ACADEMIC SKILLS % 
IN ENGLISH AS A SECOND LANGUAGE 1 r 



Muriel Saville-Troike 
University of Illinois at Urbana-Champaign 



The focus in second language research, testing, and 
teaching has shifted during the past decade from tha 
attainment of grammatical proficiency to the acquisi- 
tion or learning of "communicative cdmpetence." This is 
generally defined as including all of the knowledge 
about grammar and vocabulary considered in earlier 
theory and practice, plus knowledge about appropriate 
use of the language in naturally occurring social 
contexts, ability to use the' language for desired 
purposes (for example, to request, command, or inform), 
and skill in suph interactional tasks as initiating and 
maintaining a conversation. 



In spite of 
been, to date, 
such- things as 



change of focus, however, there have 
'ery few studies which have examined 
J(1) the development of communicative 
forms in a naturalistic setting, (2) the nature and relative 
effectiveness of different communicative tactics that 
limited English-speaking children use while they still 
have very limited linguistic means at their disposal, or 
(3) the relationship of growth in communicative 
competence to aca demic achievement when that must 
be acquired and evaluated primarily through the 
medium of English. 

For years, we. proceeded under the assumption that 
learning to use a language was a unitary phenomenon 
and that whatever proficiency was developed would 
serve both communicative and academic needs. James 
Cummins (e>g. 1980) and others have made us aware in 
recent years, however, that some students may become 
'quite effective in interpersonal communication in a 
language, and yet apparently not possess the linguistic 
skills retired to succeed academically thrpugh the 
medium of that same language. This is true in 
first-language acquisition as well as second-language 
learning, as we observe many children (and older 
.students) who communicate quite effectively in the 
highly contextualized situations of face-to-face 
interaction, and yet fail in the context : reduced linguistic 
tasks required in reading, writing, and more advanced 
cognitive processing. 

Now the general assumption in both our teaching and 
testing .is that interpersonal communicative skills are 
lower on a developmental continuum than the skills for 
academic linguistic competence. We assume, for 
instance, that the communicative training we now 
include in ESL -lessons will contribute significantly to 
our stud^ts' success in learning through the medium 
of English, but we remain generally unaware of what 
the lingutstic demands on them actually are when they 
are in science classes, mathematics classes, and other 
content areas. And our evaluative scales including 



FSI-type ratings assume that ability to function 
effectively in academic contexts in a language 
presupposes ability to function effectively in 
interpersonal contexts as well. But this is not 
necessarily the case, 

8ecause empirical evidence to suppVt our assumptions 
about such issues is still quite limit*. I wish to present 
some results from research I have be^8 conducting with 
Erica McClure and Mary Fritz (cf, Saville-Troike et al., 
1982 and McClure et al., s 1 982). Then I would like to 
draw some tentative conclusions from our findings, to 
date, and discuss their ppssible relevance to research 
methods, testrng procedures, and classroom practices, 

Our researeh^hus-fa^has"addressed-the^followtng"more~ 
specific questions: 

1. What do children who are new to an 
Engfish-speaking milieu need to accomplish in 
various communicative contexts during their 
.first weeks and months of school? 



2. -What tactics do they use to fulfill these 
functions, and how do these develop over 
time? (By "communicative tactics" I mean the 
linguistic or nonlinguistic means at their 
disposal that children select to try to 
accomplish their communicative goals.) 

3. What is the relative effectiveness of different 
tacticsfor accomplishing the children's intent? 

4. And finally, how do communicative tactics arid 
grammatical development relate to how well 
the children succeed in learning through the 
medium of English? 

I would like to begin by briefly describing our 
population and procedures. 

The population for our research consisted of twenty 
speakers of Japanese, Korean, Hebrew, Spanish, 
Icelandic, and Polish who were enrolled in a 
multilingtfa! program during 1981-82, of whom . 18. 
completed the school year. In the elementary school 
where we carried out this research, almost half of the 
students are native speakers of a language other than 
English, and all native English-speaking students are 
reqgired to study a second language. 

The children in our sample ranged in age from 7 to 12 
years (grades 2 through 6). "At the time of their selection 
at the beginning of the fall semester, all. of the subjects 



1 The research cited in this report was supported in part by the Center for the 5*tudy of Reading, under a grant from the National Institute of 
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(a) knew, very little or no English, (b) were tflroady 

literate in their native languages, ancl (c) had 

well-educated parents (professors or graduate students 

at the University of Illinois), ^ 
£ v 

The twenty children were divided into two groups for . 

daily 30-rriinute pull-out ESL instructional periods, 
. according to their age and maturity level, I videotaped 

these sessions once a week for 30 weeks during the 
1981-82 school year, and one of my colleagues 
observed and audiotaped the first ten minutes of each 
session anbther two times a week. As part of the 
research/design (but in a situation also occurring 
naturally in previous years), the children were 
videotaped while' without adult supervision or 
intervention before the ESL teacher came into the room, 
as well as during "the ESL, lesson. Additionally, we 
observed weekly in regular classrooms and in other 
»6hool* contexts, videotaped once a month in regular 
blassroQms, and audiotaped and videotaped irregularly 
in other situations (e.g. on the playground, in the library 
and halls, and in native-language reading sessions). 

We began our determination of the communicative 
tasks these students faced with a microanalysis of the 
ESL instructional periods. Videotapes were scripted, 
.and then all communicative acts in sixteen of the 
sessions were cpded for inferred speaker intent, using a 
system adapted from Dore (1978). This system includes 
such categories of communicative acts as requests, 
responses to requests, descriptions of facts, statements 
of rules and beliefs, acknowledgments which recognize 
and evaluate responses, organizational devices that 
regulate contact and conversation, and performatives. 
Our most significant modificationrWas in extending the 

; system to .include nonverbal as well as verbal 
communicative acts; additionally, we have ;added 
request and response typfcs which may be unique to 
language teaching situations (e.g. "Use a complete 
sentence."), and new categories for translation and 

9. interaction between children (e.g. invitation, 
acceptance and rejection of interaction, display 
behavior, and phatic communion with no propositional 
content). 

' Our fir;st relevant finding is that children need or want to 
communicate for very different reasons in different 
types of social or instructional events, even within one 
time slot that is considered a single lesson. 

The ESL lessons, for,,in stance, divided themselves into a 
regular .sequence of events that required variety of 
communicative skills. The first was "unstructured" or 
"child-directed," as the children interacted with one 
another before the teacher arrived and called them to 
: the instructional area. When she did, there followed an 
event we called "claiming a seat," during which the 
children challenged, claimed and negotiated for their 
favorite position at a single large table. 



During these two ovents, almost all communication that 
took placo was between children and generally' 
between children that did not share a common native 
language, Their primary reasons Jor communicating 
were either Interactional or Performative, and they had 
little need to request or give information of any kin^ 
The most common communicative goals were 
engaging another child in play, teasing, protesting, 
calling attention to things, and claiming objects or 
territory. To achievethese goals, children do not really 
•need language, as^j M fi children demonstrated. The 
form of most of thtSe acts was>pntirely nonverbal, i.e. 
gestures and physical contact, accompanied by 
non-speech sounds. It is interesting to note that there 
was almost no increase in the amount of verbalization 
used in these events throughout the year, even as the ^ 
children developed profiqiehcy in English. One 
significant change in the verbalizations that did occur 
was from use of the native language to protest and 
claim (even with children who didn't understand it) to 
such English routines as "Don't do" and "That's mine." 
Observations and recordings on the playground yielded. 



the same finding: Children can and do participate in a 
great deal of . social activity, even when fairly 
complicated rules are involved, with little or no 
language. Other chifdren taught non-English speakers 
games by demonstrating What to. do art^bl correcting 
their mistakes with a simple "No." or v^h { physical 
intervention. ^ 

The earliest tactics that our Subjects used to converse 
with one another can be illustrated with an episode that 
occurred in the third week of videotaping. This ^ 
sequence occurred as three boys - speakers ot 
Japanese (J). Korean (K). and Spanish (S) - worked 
together ; to assemble a puzzle map of the world. 
Conversational rounds consisted mainly of one xhild 
naming a referent and the other children taking turns 
repeating the term in round robin fashion. 

* S: "America." 

J (He found a puzzle piece of the United 
States and put it in place.) 

J picked up S's hand and moved it out of the way. 

J: ."Japani, Japani." 

(He pointed to the place in the puzzle frame* 
where the Japan piece will go.) 



"Japani." 
"Japani." 
"Japani." 
"Japani." 

"Ah. Japani." " 
(He found the Japan piece to the puzzle.) 

"Korea." 

(He also claimed to have found Korea.) 
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Adding only "No," and using different stress and 
Intonational contours, the children disagreed and 
questioned one another. 



^No Korea." 
"Korea." 
"No." 
"Korea." 
"Korea?" 
"No." 



S: (Spanish) "Si?" 

-# 

The Korean boy offered an explanation, which the 
others acknowledged by repeating it after hirVi. 

K: "Yes, yes, yes. 

J No Korea. Pink Korea." (Stress on "pink.") 

"Pink" referred to the color of the Korean map piece: i.e., 
"It can't be Korea, because Korea is pink." 



"Pink." 
"Pink." 

"Korea." , 
(He found the piece and put it in place.) 



The ESL teacher entered the scene, telling the class to 
put away their games. The Korean boy unsuccessfully 
tried to pfrotest. saying to her^T Korean (which the 
teacher doesn't understand): 

"Ah 1 1 have to finish." 

This sequence concluded withwiore repetition and 
nonverbal tactics as J and K leaped up and ran toward 
the table, and the teatrrer gentlyistopped K with her 
arm around his neck and shoulder. . ; 

K said "He," as he pointed toward the table, protesting 
that J got away with running to claim a chair. He then 
used the most complex English structure of this whole 
sequence when he yelled to J, "Hey, walking" . 

J responded in Japanese, "Damare! 
"Shut up.' 



hich means^ 



It is apparent that from the beginning these children 
have conversational skills for attention getting and turn 
taking, and even with minimal language, relate their 
moves in a coherent manner to the form and content of 
the 6ne,that precedes it. In Grice's (1975) terms, they 
know the cooperative principle of conversation, and are 
"relevant." 

The earliest communicative forms available to these 
children (as for children acquiring a first language) are 
nonvocal behaviors, sound play and nonspeech sounds, 
simple routines, repetition of part or all of previous 
utterances, and single » referential terms they are 
beginning to learn in English (cf. Keenan. 1974: Peck, 
1978). .Additionally,- these second language learners 
have access to first language forms and tactics and 
havehad experience interacting with'other children. 



Thla pattern held true for all of the children In this study 
who did Interact socially with other children, but some 
of our subjects did not, I will return to this point to 
discuss the relationship of peer interaction to language 
development. 

A second type of event during the ESL sessions was 
highly structured and teacher-directed. Including such 
opening routines as "What day is it todayP^and the 
focal instructional activity (usually introduction of new 
vocabulary or new grammatical structures). During 
these events, almost all communication that fook place 
was between child and teacher. The aim of most of the 
children's communicative acts was to respond to a 
question or request from the teacher. The only othfcr 
acts that occurred with any frequency were children's 
bids for the teacher's attention. The teachers' requests 
changed in form through the year from requests to 
repeat after them and identify single lexical items to 
yes/no questions and WH-questions. The children's 
responses matched these changes, going from primarily 
repetition, t o nodding yes or no, to single words, to 



(primarily memorized) phrase and sentence patterns. 
Observation and recording in the children's regular 
classrooms indicated that this same participant 
structure occurred in small group instruction where all 
students in the group were at a similar level in their 
language proficiency, especially for reading, or when 
— the" re g ul a r cf ass r oom teac h er w or ked with studen t s 
individually, but this seldom accounted for more than 
another 20 minutes out of # their 5 1/2? hour school day. 

Occasionally we observed instances where the 
children's facility in responding to such questions / 
(combined with the skill in learning procedures 
nonverbally frorr>^5ther children that I have alread, 
mentioned) ^rTTlssfed an adult in the regular classroom 
into thinking children understood much more English 
than they actually did. Early in thfe year, for instance, 
children learned the narjfes for shapes in E$L Back/in 
his classroom, pne very limited-English speaker Was 
building an elaborate pyramid for an art lesson by 
gluing toothpicks together when h^-was asked, "What 
geometrical^ shape is that?" He, certainly understood 
only the/word "shape," but appropriately responded, 
gle/ When the adult working with him was asked 
valuate his English at the end of the. art lesson, she 
orted that he was quitefluent, 

* ■ 
The third type of event during the ESL session was . 
semi-directed by the teacher, but included children 
addressing each other as well as her. These events were 
characterized primarily by the need for a real exchange - 
of information and constituted the largest time block'in 
the classes we observed. This included the teacher 
asking the whereabouts of absent children before the 
structuredSesson began and less structured follow-up 
activities, usually including explanation and * 
implementation of an art project or game. It was only in 
this type of event throughout the year that the functions 
of the children's communicative production regularly 
included the categories of Requests, Descriptions, and 
• Statements, and it was only ir; this type of event that 
they went beyond memorized patterns as they 
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struggle! to obtain needed information, or to express 
themsevoa to the teacher and other children, The 
results jworo often unflrammatical, but it is interesting- 
to note that grurnmor was never corrected when real 
communication was at stake, Successful cpmmunica- 
tion, ih fact, often included e combination of English, 
nonverbal miming, and the speaker's native language. A 
.preliminary analysis of 'the correlation of the language 
forn^s presented in the structured lessons and the 
language forms actually used by the children for less 
structured communicative purposes shows little 
carryover except for vocabulary. 2 The "is" of "This is a 
pencil" for/instance, was generally not used for weeks 
or months in other contexts, and (along with articles^ 
and j^ast tense or plural inflections) is still absent for 
some'Who have by now achieved considerable fluency 
in both communicative and academic uses of English. , 

There was of course intragroup variability. A few of the 
children sought interaction only with the teacher during 
these events, and a few almost entirely with other 
children. When another child in the group spoke the 
same native language, some fulfilled these goals 
primarily through that language rather than English, 
especially during the first half of the year. Again, I will 
return to this point when I discuss the relationship of 
peer interaction to English language development, and 
to academic achievement through the medium of 
EnglishT ~ ' 



Observation and recording during the rest of ,the 
children's five and one-half-hour school day showed 
much more carryover of these communicative needs 
and skills to other learning situations, with some 
interesting similarities and differences in tactics and 
results. Children who preferred to request information 
and express themselves in their native language 
continued to do so, either seeking out a bilingual adult, 
or making use of bilingual classmates to explain 
directions that had been given in English, Not 
infrequently, these limited-English speakers joined 
forces to figure out problems they could not 
understand. Translations of the non-English portions of 
our recordings indicate a very high percentage of native 
language communicative acts was directly related to 
the assigned academic tasks. Some children used their 
native language in this context even when there were 
no bilingual adults or children in the class who shared 
their language, verbalizing extensively to themselves as 
they worked. Standardized achievement tests adminis- 
tered to all Students in the school in English at the end 
of the year (the CTBS) show us in retrospect that this 
language preference group includes three of the five 
children who scored highest in English reading and 
science. 

In contrast to the similarities in early conversational - 
tactics, children used basically different means for 
describing events and actions during the early stages of 
learning English. One tactic was to use a single lexical 
term (usually the name of the topic the child wanted to 



convey information *about), Completion of the, 
proposition required a cOoperativo addressee, who 
asked one or rriore questions thot ailo.wod the child to 
respond with a simple "yos" or "no," This tactic was 
used more frequently with adults' than with othor 
children* ■ • 

The following sequence, which exemplifies this tactic, 
was a Japanese speaker's (J) response the teacher's (T) 
question about why another child had not come to class. 



"Picture." 

"She's making a picture?" 
"Yes" 

"Will she come when she's finished?" 
"YejT 



The same cooperate process contirTtTed for the 
construction of longer texts even after children could 
express a whole proposition themselves by stringing 
together multiple terms in English, with adults first 
providing expanded models, and later prompting 
expansions by asking WH- questions. This rarely 
occurred in child-child communication, and never 
between limited-Engtish speakers. 

The second tactic was to create a complete 
topic-comment proposition by using both verbal and 
-nonverbaUmeans^usuaUy^naroing^he-tofiic-in Eng l ish 



and then providing a nonverbal comment. 

In the following example, a Korean boy (K) named both 
agent and object as he communicated to the teacher (T) 
that Taki^ad thrown a pencil at him while passing out 
supplies. > • 

K: "Teacher. Teacher." . j 1 ■ 

(The teacher went over to him.) 

K: "Taki. Pencil" 

(He made a throwing motion.) 
"Taki. Un ... he." 

(He made a throwing motion again.) 

T: "Taki, please don't throw pencils." 

» 

Most children used this tactic more frequently with 
other children than with adults, but a few used it almost 
exclusively for any situation. The general pattern that 
developed for longer ngmjtives by children who 
preferred this tactic was: TT\gejting attention (e.g. 
"Look, look"), (2) indicating focus (e.g. "this," "right here," 
tapping the object that was in focus), (3) naming 
objects, and (4) acting out the ev$nt. 

Students who developed considerable early 
competence in communicating with a combination of 
English and miming to other limited-English-speaking 
children and the teachers in ESL did not have the same' 
success with teachers in their regular classroom. This 
tactic, however, proved to be very successful for 
communicating with English-speaking children! and 
was used by all of the children who from the beginning 
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areas taught in English in their regular classrooms. 



-140- 



133 



of the year interacted most with English-speaking peers 
at school But this group included nonp who 
subsequently scorwd highest on the CTBS • 

/' r 
This was a surprising finding wince our original 

assumption waSithal children who interacted the moat 

socially with other English -speaking children would 

"learn English faster, and thus do bettor in 

English-medium instruction; tins turned out not to bo 

the case. In (act, along with tho three strong 

nativo-languago-proforonco students who scored 

among the top five, the othor two top achievers raroly 

spoke at all to tho othor childron during the ESL 

sessions that wore audio and vidootapod. Evon in their 

regular classrooms and on the playground, thoy 

appoarod to make minimal use of" their doveloping* 

English'for social purposes. 

On the other hand, some of tho other children who did v 
not engago in much verbal interaction with their peers 
proved to be among the poorest. in their language 

- development and academic achievement in English, so I 
am not suggesting any simple correlation and certainly 
no cause-effect relationship. A striking, observation, 
nevertheless, is that some of the more successful 
communicators, at least in the contexts we recorded, 
fossilized at fairly early levels of development, 
suggesting that their very success may have_reduced 
their motivation to learn more complex linguistic forms. 

* This fossilization was particularly evident anrteng those 
who continued to combine verbal and - nonverbal 
actions to express propositions. v 

In addition to the types. of communicative events I have 
just described, two other kinds of instructional events 
were frequently, observed and recorded in regular 
classrooms, but never in ESL (in part because of the 
smaller class size), and it was in these that we Saw most 
of the limited-English-speaking children entirely unable 
to cope: staring out windows, doodling, poking their 
4 neighbors — one even crawling under hi& desk in 
retreat. * ^ 

These are the. teacher-to-whole-class participant 
structure, where the group was being talked to (or read 
to) as a whole, and its opposite, the fully independent 
work activity period, where students were given written 
or oral instrustions'm English and expected'to proceed 
without additional interaction with adults or peers. The 
amount of time spent in these types of events varied 
greatly from class to class, but occurred to some extent 
in all. In the classrooms we observed through the year, 
these kinds of events constituted from ten to over fifty 
percent of the instructional day. The percentage 
generally increased with grade level, and I feel sure we 
would find students engaged in — or disengaged from 
— even more of these lands of instructional situations if 
our research had continued into junior high and high 
school. > . 

In spite of the, very similar nature of pur subjects' 
socioeconomic status, family educational level, prior t 
literacy in their native languages, and even type of 
residence and play facilities (they all lived in the same 



unlversitv^owned housing complex), by the *wb of the 
year they differed greatly in the degree to which they 
had mastered English and in their level of 
accomplishment in reading and other subjects taught 
and tested through the medium of English As I 
summarize the final results of our study, important 
questions to keep in mind are: What did ESL contribute 
to the achievement d\the successful students? And 
what more can it do? 

First, most children do not havo to be taught to 
communicato with one anothor; while social interaction 
betwoen them is certainly to be encouraged, we cannot 
depend on that alone for'dovoloping English language 
skills, 

Second, there is a qualitative difference between the 
communicative tactics and skills children find effective 
for meeting their social needs and goals and those that 
are necessary for successful academic achievement in 
the classroom. As teachers of students who must learn 
to learn through the modium of Englishrwe^trauld~be 
concerned that academic achievement — not just the 
learning of English — has a clear priority in our curricula. 

Third, the portiqns of our lessons whicfr focus on 
structural patterns, especially on English morphology, 

appear .to haye_Jgaat_ applicability ' to ^students' 

immediate academic needs. Not only % do most 
beginning students not use the grammatical inflections 
when attention is on receiving or imparting information, 
they really don't need to. One example is provided by a 
third grade student in our sample, a Japanese girl, who 
scored at the 65th percentile of national norms in 
reading on the CTBS after only one year of Enghsh, and* 
above the 90th percentile in all content areas infe test 
battery. Yet, she still systematically omitted plurals-, 
■ articles, and tense markers in her spokpn English, as did 
four out of five others among the top six academic 
achievers. On the other hand, the^Polish, Icelandic, and 
Spanish speakers ranking 13th, 14th, and 17th out of 1B 
in achievement scores had a^far better mastery of* 
English morphology -because of positive transfer of 
inflections from their native languages. 

J This is also quite relevant for testing and jDlacement, 
since we found no significant correlation between 
students' scores on the Bilingual Syntax Measure 
(which emphasizes morphology) and their ability to 
achieve in English-medium instructional context^ The 
only language factor that does appear to be critical is 
vocabulary, which usually receives woefully inadequate 
attention in ESL teaching and testing. The 
determination of which vocabulary is critical for 
specific students requires closer coordination witf\ 
regular classroom teachers than most specialized ESL 
teachers maintain and a better knowledge on their part 
of the content areas of the curriculum. . / 

Fourth" as I reported' earlier, the lowe$> academic 
achievers in our sample were among the most 
. successful at interpersonal communication, especially 
witl| other children. Academic success . requires 
competence in using and und^rsjanding language in 



eontaxt'fedueed aituaiiona m which students cannot 
rely on nonverbal elements of cpmuumlciitipn The 
language skill which is must likely to develop- this 
competence is writing, yetlH is sadly (*t»t ruit only in the 
listening apeakmg -reading writing sequence, we recite, 
but also m the time and attention allotted it in moat ESL 
classes 

The tew students in our sample who could cope with 
independent instructional activities in regular 
classrooms possessed skills that were not taught in 
ESL but could be One important skill they shared, for 
instance, was their ability to make good use of a 
bilingual dictionary One mother had taught her child 
this skill by encouraging her to write sentences in her 
native language and then look the words up and 
translate the sentences into English -This yielded 
ungrammatical sentences, of course, especially since 
the mother did not speak enough English herself to 
provide correction (Koda. 1982) Still, the skill proved to 
be very useful in school and might well be added to 
elementary-level. ESL objectives for the benefit of 
children whose parents dp not provide this instruction 

Finally, I would like to emphasize again that most of the 
children who achieved best in , content . areas, as- 
measured by teits in English, wore thoso who had the 
opportunity to discuss the concepts they were learning 
in their native language wwth other chilc^en^ or adults 



Even in linguistically heiereyeneQyi yiMi'Pv»»« w ch 
as those in our research! at least some degree of 
bilingual education is proving to be feasible and cleaily 
provides the best contest (or conceptual development 
and fui learning English V ^ 

I be\jan by saying that we in TCSl have shifted our 
locus from grammatical competence to communicative 
competence, and I consider this a positive 
development" Out I must close by saying that this still 
falls short of our responsibility to students who must 
learn to learn through the medium of English - which 
includes most of odr students We need to develop their 
academic compewncp as well, and this calls for even 
mora changes in our priorities and in our procedures 
Developing communicative competence alone is a 
desirable but insufficient goal for English teaching, and 
- ironically — may even interfere with academic 
achievement We must begin to place more emphasis 
on vocabulary learning and less on grammar (You will 
note I have hot even mentioned pronunciation) We 
must recognize the value and importance of the native 
language as a basis on which to build the learning of 
English And above all, we must realize that for our 
students, learning English is never an end it itself, but 
always a moans to fch end We must strive to keep our 
students' needs uppermost in mind in determining our 
own priorities 
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Promoting concept and language dcvc lopment in the clAssroom 

Rodolto Jacobson, Ph 0 
University of Texas at San Antonio 



Introduction » 

|ho r oiilct to i huo*« in iinlni tut <i < f tilil' whOao hiat 
lanyuriyti hut I fiylrtb t<> b«n unit? truly c omnehhvn in <t 
siting witti chdtJmn who nf« hfi|rv« M >,,rt ^ s n * * Mtjhbti 
15 neithw tht* lunty pijliout hum <i bilingual mJiu-rttiuii 
program ftuf \hr* prtmidtijre submersion Idbtittid 
"immer^ujr >" into <in t ngloh unly if ictiri^lf «<itf 1 netting 
Th« ahswtif in turn. ?>t»eim tu be hi Ihci !>u* easeful, 
unvxruituin uf diffufvnt hut by nu means 

contradictory, yoah thu at hievement nf the thnnhuld 

ItiVti) m I, and A stdblti Wrll balanced distribution 
uf L j and I , while cum eutudl Imtrning lc\kv* place it in 
tho object ivu of thin papnr to elaborate on those two 
yuuls dncl to report t>r i tionio earlv results tr t?ni the 
implementation ol a projeet tho Title VII 

Dftrnon*|ration Project »n Bilingual Instructional 
Methodology - which precisely st»i»Ks to test thu 
validity of t ht-i hypothesis 

Prior Research. Hm author's research leading to the. 
instructional design of tho Project funded by tho US 
Department of Education "yews hack a number of years 
including consultary work and research activities ih 
Laredo and Sari -Antonio. Texas (11)75 82) Tho findings 
that resulted from tho so activities have appeared in 
journals and anthologies, and it may t>cj in order to 
briefly sumrnan/a those to lend greater credit to the 
objectives of the ongoing project 

Tho authors studies during the last decade nave 
focused on three topical areas ^ (a). community verbaJ 
bona v tor with emphasis on Spanish- tny I ish 
codoswitching. (b) Chicono English as an emerging 
speech variety of American .English, and (c) bilingual 
instructional methodology with emphasis on language 
distributional aspects The hrst and the last topic turned 
out. to be rnosi relevant to the formation of the program 
design, sinco the former determined the extent to which 
the two languages could and should be jointly utilized 
in the teaching of bilingual children and the latter led to 
the development of a bilingual teaching strategy knows 
as the New Concurrent Approach which upgrades the 
somewhat questionable Concurrent Translation 
Approach that underwent some criticism by scholars 
like Theodore Andersson. Bruce Gaarder. Christiana 
. Bratt Paulston 1 and others 

The author's studies on codeswitchmg comprised "The 
Social Implications of Intrasentential Codeswitching - 
(Jacobson, 1977). "Anticipatory Embedding and 
Imaginary Content*' (Jacobson, 1978b) and "Consensus 
and Divergence in Bilingual Co-occurrence Patterns 



Suntd Md*H (in Amaru an Vdnnoma yf t ia a Night bettor M 
(hil&Uhda " (paper tUmyc*ft»d at |h« UWM. University of 
vVist oritur* a\ Milwmjkeo. I inguiatit-ti Symposium. 1U/B, 
unpiitilmhtid) Th« need to adapt the community 
nwitchiny to and/or limit it for *laatjfoom purposes 
twirtituj nvidoitl Irom tho vory beginning and th** 
various papers on bilingual tfwthodology emphasi/e the 
importance tit avoiding any switching in^id« tha 
ti^inleHitti dnd ddvolopmy ^ 6«*t of t>trat«gie!i dosignad to 
nt^kn tlntrio irttertiantontidl dWitchirtg t?vetnt?i purposeful, 
thdt is t to rertdwr tfiem hnguintiCrttty psychologically 



rtnit»tHioK>gic<illy juritifiabla 



0 



Tallowing the publication of hi5 paper presented at the 
Ui/b TLSOl (Teaching English to Speakers of Othor 
Uinguao^rs) ciMiferonce. thu author has attempted a 
constant rofirfimiont qi the. language alternation 
suateyy until a satisfactory model would evolva that 
Jertt itself to domonstrata ttie effoctivttnoss of the cited 
approach The publications that reflect this gradual t 
ertieryence of -an approach \ controlling the dual 
language uso'of teachers and students show that what 
is sought here is neither random riot cio^jfusing, but 
rather sociolihguisncaliy relevant as it reflects the 
norms found m multiple settings whero two languages 
are m continuous contact (Jacobson, 1975a,. 1975b, 
1978. 1979a. 1979b. 1981. 1982a. forthcoming) 

Threshold Level and Second Language Acquisition. The 

educational research in Canada (Cummins, 1980) has 
offered abundant evidence for the\ need of developing 
the child's homo language and holping him/her to reach 
in it a threshold level that then permits tho learner to 
gain proficioncy in tho school language The analogy of 
the icivberg m attempting to conceptualize the 
problems of L } acquisition is by now widely known 
(Cummins. .1980.- 29-36) Equajly well known are Jhe. 
acronyms CALP and 8ICS in picturing the acquisition 
process controlling the success or the failure of the 
child expected 'to learn in school through a language 
other than hiS native language (Cummins, 1980: 28*40). 
Both arguments show the urgency of developing L t in 
the child and making him/her literate in it so that the 
child may transfer to L 2 the deeper cognitive skiffs that 
he/she has developed in his/her first language 

The question however arises whether L should be 
acquired only after the threshold level in t, has been 
achieved or whether both languages can be acquired or 
developed in an almost simultaneous fashion An L /L 2 
consecutive acquisition is hardly viable in a setting like 
the > U S where early exposure to English is unavoidable 
and the learning of the mainstream language is usually 



{he? Coux.. urrorit Transljt 



The difference bi*iwi»i»ri M"' NCA"mt»ihud jruJ its pjriiw vt-iMur 
the latter is meffl'v transfers! t'.> tho former wthcut (uMher study Se«> Chrjst 
•ecenllv m RacmP. Wisconsin (VaU)man Hartford. Foster 1982 XIII) 



n Approach »s still poorly understood The opposition to 
ci Br.nt Paolston s mtriKluclK'n to a collection of papers delivered 
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equated with mainsireaming and upward mobility. It 
has been rewarding to see that simultaneous- or almost 



simultaneous acquisition is now 
possible option even for the 



also considered a 
Canadian setting 



(Cummins, this volume). This author has been certain — 
t fpr some. time now — that such a near concurrence of. 
■*L 1 development and L 2 acquisition is most feasible and 
even necessary for : settings where the pressure of 
learning L. 2 is as-formidable as in the U.S^In other words, 
the home, language can be. developed effectively 
through dual language learning tasks if one keeps in 
7 mind that the threshold. level in the first language must 
eventually be reached. \ 



As this latter objective is being -pursued, the languag 
of the majority is ; ,also learned producing in the child an 
early bilinguality that then permits the parallel 
development of his/her ' two. languages. This early 
effing uality, however, is not onfy the result of the child's 
exposure, to two languages and his/her dual language 
'development but also — and to a very .considerable 
ex t en t — ' to the notion of stability* in language 
"distribution that characterizes the truly bilingual person. 
As a matter of fact, the balanced'distribution .of his two 
languages, that is, the use of as much L 2 as denotes 
equal language status or prestige. Hence, any L« 
increase in the bilingual classroom that is accompanied 
by a decrease in the use of L 1 — however gradual this 



•maybe realized— Will. always Re interpreted as a "put 
down", of one's vernacular, a situation that , will 
ultimately lead to the apparent or even the actual loss of 
th$ home language. V ; \ 

Examine the 'following table that illustrates that 
instructional design pbseryed in/the K level of the cited 
Federal Project. Regardless of " its* identification as 
comparison or treatment classroom, the amount- of 
Spanish used to teach the children is such that the 
threshold in that languaap. can eventually be reached. 
On the other hand, onl| the treatment kindergarten 
demonstrates the stability of language distribution 
discussed before (see ab«ve) as its ratio is a steady 90% 
to 10% in favor ; of the borne language. The .10% of 
second language instruction, furthermore,- is restricted 
there to language _arts' (ESL). The comparison 
s kindergarten, in turn, shows the decrease-increase^ 
pattern as the<ra*tib of 90%:10% in favor of Spanish at 
the beginning of the year is gradually changed to a 
75%:25% ratio. Within this 25% of English, not only 
language arts but. also math, science or health can be 
taught. This introduces the child to the trend in 
conventional bilingual education where the teaching in 
the vernacular is decreased as the teaching in English is 
increased. It also introduces the child to the separation 
of the two languages on the basis of school subjects, as- 
such is found in every traditional program. 



Table 1 



Comparison 


. Treatment 


English 


k Spanish 


English . 


Spanish 


i 


90^75% 
of time 




90% of time 


10-25% of time 
(lang. arts & cpntent) 




ESHl0%of time 
■ as lang. arts) 





Answers to questions, that are addressed in a project 
' implementing such a design are of great-Importance to 
any bilingual instructor and likely to provide, us with 
some interesting' data as -to what does and does not 
work in bilingual education. Some of- these questions 
" mig^t be as follows: . 

1; Do all the children at the K level eventual^yiYeac^ 
. thethreshold level in their home language? 

2. Does the decrease-increase pattern in the 
comparison classes, in effect, produce a feeling of 
English language superiority and Spanish language 
inferiority? 

3. Does the stability, of distribution in the treatment 
group; in turn, give the child greater security in 
regard/to his/her attitude to English and Spanish? 

4. ' Does the use of English in science-related subjects 
■ give the child the impression that only English can 
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handle these areas of content? * - 

5. Is the limited use of English in the treatment classes 
unrealistic for a child who is- expected, in the 
following year, to receive half of his/her content 
teaching in that language? 

Without any doubt, the kindergarten level presents 
itself as the ideal stage to set the groundwork for later 
achievements. It is here where. we can develop further 
the language that the child brings from his/her home 
and, at the same time, attend to his/her language 
attitude formation. Since, no specific, academic 
achievement is required at the pre-primary level, there 
is sufficient time not only to use the home language in 
the classroom, but also to correct any biased perception 
that the child may hold of the language of the home or 
any overrating of the majority language that the child 
may feel to exist The prejudices that the child does 
bring to school are rarely deep-rooted and can thus be 
removed quite easily so that he/she begins his/her 
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137 



primary education 
problems. 



with few, if any, psychological 



languages concurrently, that is, switching from one 
language to the. other' as the teaching/learning' 



situation may rfequirg. 



incept Development through the New Concurrent. 
S^proach . 

The New Concurrent Approach has been^defined and 
vexplaineeM^ths-autooH^^^ 

in forthcoming publications. The New Concurrent 
Approach," he argued in a- paper delivered at the 
v Ethnoperspectives Ill/Conference in Ypsitanti, 'Michigan 
(J^cobson^98l; 14-29):,: : L * \ 
■ % resultecTfirom the author's desire to bring together 
the iStJilcTs/.two languages in a way that would 
further the latter's. language development. acid, at 
the same time, . lead/. io • satisfactory school 
. performance. To accomplish this objective, he had 
to address several issues: 



"Each switching 'instance" he continues saying, fshajl 
be ^edagogically justifiable in light of four criteria: '■ 



(D'^.the two languages are distributed at 
apftroximateiatixxof 50%-5Q%l. 



an 



(2) the teaching of content is not interrupted:' 1 

(3) ' tHe teacher is conscious of his/her alternation 

between the two languages: and 



(4) the alternation 
learning 



accomplishes a .specific 



(Jacobson, 1981: 14) 



(1) the extent to which the native language must 
1 be developed in order to slicceed in learning a 
secoigj language: (see above) 



(2\ the extent to which the. home language should 
be used jn school to develop a positive attitude 
. toward ft;' 



The- intersentential switching for which the author is 
1 y^rguing is based on the utilization of a framework 
known as the System of Cues "Cues, in NCA (New 
Concurrent Approach)," explains the author in the The 
Role of the Vernacular in Transitional Bilingual 
Hartford, Valdman. ...Foster, eds., 1982: 



Education" 
'183):- 



(in 



(3). the (extent to which first language maintenance 
■ in\the primary grades would not interfere with 



the transition 
education: 



to' English in post-primary 

(4) ' the extent to which the use of both languages 

would lead to an^understanding of the bilingual 
functioning of somelsectbrs of.our society: 

(5) the extent to which school subjects would be 
learned through two language media. 



are signals that the teacher identifies in her class^ 
and that she wishes to respond to. As an 
.enthnographer seeks to gather all the elements of a ( 
social situation and. then arrange^ them in a 
■* meaningful order, so the teacher identifies the 
various aspects, pedagogical and social, that sh$ 
/feels she must address^ These elements, or cues, 
are categorized depending On their relevance to 
classroom management language development, 
curriculum, or teaching-student interactional 
norms. 



In elaborating on this last issue, the 
emphasized that, jn the primary grades; 



author has 



Table 2 lists . these 
corresponding cues. 



School subjects other than language arts utilize 
concurrent teaching techniques in which the 
teacher shifts, smoothly from one-language to the 
other as the lesson is carried oh. This alternation 
. enables the teacher jointly to develop the bilingual 
4 child's two languages without jeopardizing his 
school progress. Thus; concepts are forced or 
. reenforced in both languages; the lexicon is 
. expanded and many other meaningful activities go . 
: orxin both languages simultaneously.' 

• . • % . • /' (Ibid, 18) 

In the actual proposal submitted to the Office of- 
a Bilingual Education and Minority, Languages (OBEMLA), 
U.S. Department of Education, the author has attempted, 
to define the approach more concisely as: . . 

A strategy 'through which the bilingual teacher 
teaches the school curriculum (except language 
arts) in the child's two languages by using both 



pedagogical areas and the 



Table 2 



System of Cues 
(1) Classroom Strategies ' (2) Curriculum '* 



a. Conceptual 
Reinforcement 

b. 33 Review 

c. Capiuring of 
Attention 

' d. Approval/Disapproval 

(3) Language Development 



a. Language 
Appropriateness 

b. Content 

c. Text 



b. 
c. 



Variable Language 
Dominance 
Lexical Enrichment. 
Transiatability 



(4) Interpersonal 
Relationships «■ 

a. Intimacy-Formality 

b. Courtesy 

c. Free Choice 

d. Fatigue 

e. Self-Awareness 

f. Rapport 
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The. planned language alternation, as discussed in the 
quotes above,, is serving as the teaching method in the 
treatment group of the cited Title VII Denrtonstrafon 
Project.The dual language use in the comparison grdup. 
on* .the other hand, is conventional as there, the two 
langu&ges are strictly separated on the basis of content, ■ 
that is. social studies, music, art, and physical education 
ar6 .taught in Spanish, whereas math, science and 

healffiwtaaght in'Englishrltvisrin.' particular, in the 

treatment group^ that the close correlation • between 
language development and conceptual reenforcement 
is 4 observed. As a concept is taught, the teacher 
assesses the need for further language development, 
whether in the area of phonology, grammar or lexicon. 
By the .same tdken, as, the language is being developed 
during* content classes, the specific concept to be 
taught is never,disregarded T 

As' for the switching from one to the other language 
during content classes, three reasons can be cited that 
justify the dual language use in class. It is justiftable on . 
pedagogical grounds as it promotes desirable language 
behavior in each one of the child's languages. It is 
equally justifiable on psychological grounds as it 
promotes the status of the home language and places 
botffi codes *on equal grounds. The distributional 
' balance, found in the 50-50 ratio and the utilization ot 
both languages in the teaching atfd learning tasks 
cannot but-convey the message that nothing is learned 
in one language that cannot also be learned in the other. 
Finally, the switching is justifiable also on 
sociolinguistic grounds in view of the similarity .that 
exists in respect to certain community behavioral 
patterns that are being observed. Obviously, the 
difference between intrasentential apd intersentential 
switching is merely one of degG&e and as long as 
alternations do occur, they will mnpress the child as 
familiar strategies present in home and neighborhood. 

In addition to the theoretical justification on whatever 
grounds, it must be ascertained that the NCA Method is 
being implemented successfully. The prerequisites of 
successfuMmplement^tion are, however, such that they 
require special training either in the form of a univer3ity . 
course 2 or as part of inservice workshops.' Of four such . 
.prerequisites, two have been discussed above when*a 
working definition for the approach was proposed 'and 
when the system of cues was suggested as a viable 
framework for triggering language alternation with a 
preconceived objective in mind. To be more specific, 
the author has contended that the success of this 
method depends on the balanced distribution of the 
two codes and the ever-present goal conciousness as 
the teacher responds to the . cues that he/she has been 
trained to identify-and to react to. 

A further prerequisite could be labeled self-monitoring 
of dual language use. It is the need for the teacher to be 
able to detach him/herself from the teaching task and 
monitor the relative time that he/she stays in either , 



language, as well as the extent to which he/she . 
observes the educational, objectives agreed upon in-, 
advance. In other words, was as much Spanish as 
English used in teaching, say. a social studies lesson, 
and were the language switches actually geared to 
tasks, such as. conceptual development, lexical 
enrichment, the notion of language; appropriateness 
and so forth? ^ 

More from the methodological perspective, an effort 
( shall be made to convey continuity in whatever content 
a lesson is taught The language alternation must not be 
disruptive. As a matter of fact, planning an NCA less* 
differs from preparing a lesson in a singleJanguagg)onJ^ 
in the language medium, not in the content nor the 
teaching strategy. 

The strict observance of these prerequisites, i.e. (1) 
balanced distribution, (2) continuity of, class 
performance, (3) self-moriitoring of dual- language use/ 
and (4) goal consciousness, all. contribute to the 
effective use of intersentential codeswitpHing in the 
classroom allowing for concept development and dual 
language growth. j >* 



Community Behavior in Contact Situations: A Rationale 
for NCA , — 

|t may be in order to elaborate somewhat further on the 
relationship between the NCA method and language 
distributional patterns: in bilingual settings. Wherever ^ 
two languages ' are in continuous contact, three* 
situations may prevail. Only one language is spoken at a 
time.for reasons of language appropriateness or out of 
courtesy to the b or ! L 2 dominant member of;a given 
bilingual* community. Accordingly, a balanced 
Spanish-English bilingual will address interlocutor A in 
Spanis:h only ancJJnterlocutor B in English only: 

More frequent than a unilingual pattern of this nature is 
the'switching of codes/whenever pertinent, such that 
segments of both languages' occur either in the same 
sentence (intrasentential) or from sentence to sentence 
or between two discourse episodes (intersentential). To 
summarize, the bilingual's distributional patterns to use 
In his/her bilingual community are of three kinds: 

(a) unilingual — one language at a time, but not 
both; 

(b) intersentential — one language in one 
sentence and the other language in the next; 

(c) intrasentential — both languages in the same 
sentence. ^ 

* Recent research on codeswitching has focused mainly 
on ^ although some data have also been gathered — 
especially in the author's research on b (Jacobson. 
,1977).. The following examples from the study "The 
Social Implications of Intrasentential. Codeswitching" 
(ibid) illustrate these two switching strategies. 
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Table 3 



INTERSENTENTIAL 



INTRASENTENTIAL 



1. — It takes — Es mas despacio la manera esa. 
(EC:24el4rJ5). (FslsetStartL. ...... 

2. . .-. I wished they'd come more often! —You 
ought to get on the phone— y dijo mama que 
vinieran a visitar, Ives? (EC-3.20-23) (CODE: 
Initiation of response) 

. • ' : -- J , 

3. — Sabe 16 que me gusta a mi vesl Man! That's all 
kinds of bbers ahtf (EC-25.10) (CODE: Continued 
speech after switching) 

4. —No he podido grabarla conversacidn. Queues que 
dijo? 1 £ / 

— / was unable to record the conversation.- 
(RJ— memory) (CODE: Clarification) 

• • \ • ■ ■ ■ • * ... 

5. dNo? - : V* 

-rS\. He is going to be training for a manager right 
now. (EC-6.7-9) (Employment Domain) 



6. —Well, you do a lot of P.R. Cuando vieneh las 
mamases muy bien, tienes que calmarlas- 
(VC-T.3-5) (Culture: Persons) 

7. —(Wife to mother) Con la misma chaqueta por 
* V cuatroanos. 

— (Wife to husband) You are the one wearing the 
same jacket (EC-7, 19-21 ) (between spouses). 



\ 



8. — dMe aprbbo mi sopa? Ah, that's good. Este no es 
^macaroni de la bolsa. (NflETHAPHOR: For contrast) 

9. — . . ., ibamos alii siempre cada ano. We we\it there 

every year (VC-6.506) (Metaphor: For emphasis) 



Hu 



1. —And I tell ypu-another thing que I'd shoot body . . . 
<(FM.1,14pSubstratum) r ..- . « 

2. —I lose my temper porque a mi me da mucho 
coraje. .. (FM 1.15-16) (Emotion) 

3. — ... . she would tall me things like— este— you 
know (FM-5.11) (Hesitation) • , ; '. 

4. — . . . dLe borro y.le pongo speak? (FM-3.1.-2) 
(CODE: Prior code use) \. 

5. . —Ah, no— it's, hot a sound problem, it's more of a 

like como donde acentua uno la palabra. 
(VC-12.9-10) (CODE: As topic) 

6. — . . . s$ no 'biera ella dicho eso, We would take for 
granted that it was the last Saturday? (CODE: 
Anticipatory embedding) (Specialized terms) 

■ * 

7. —"How long have you been here?" Pos le decia 
"twenty-nine, thirty years." (EC-10.1-2) (CODE: 
Quote) t 

h • t .• . \ • ; ;. 

8. — ; . . un hombre precabfdo vale por dos.-^Ahpra 
digo yo si toca lo de malas, OK but I did what I 
could to prevent it. /FM-25-7>/(CODE:.Precoming) 

9. —Mi&tobrinas are the typical— you know— they 
can understand it,. . . (FM-4.12-13) Home/Family 
(DOMAIN) 

.10. —Oh, si. Si porque Irioticeque if I write something - 
down. I can remember it better. (ED-28.7-8) (School 
Domain) * 

11. —It was the 4 day you went al parque. (R-1.6) 
(Culture: Environment) 

12. —Cuando comenzo"* esto todo lo que andaban 
rfiechudos, they believe that they were no good ... 

. (EC-9.1 6-1 8) (CULTURE: Attitude/Bias) 

13. —Los doctores que vieneti de Mexico igualmente. 
hacen lo mis mo after being here for a while. 
(FM-6.10-1 1) (CULTURE: Language-Locales) 

14. —I went qnly for one sole reason to 
Mexico— porque no me contaran como era. I went 
when I was twenty a la capital. . (FM-9.22-23) 
(CULTURE: Heritage) 

15. — Ah...bueno, como me esta"s grabando— /'// take 
the fifth amendment— on that one. (VC-1.3-5) 
.(CULTURE: Social/Political Institution) 

16. —And maybe it's part of the culture, too, because, 
you know, like con nosotros "cque?'' you know, se 
oye muy mal. I mean, en mi casa todo el tiempo es 
"mande: (VC-1 3.19-21) (CULTURE: Language) 



ERLC 



-147- 



Given thq presence of both strategies in our community 
and one of them being acceptable on educational . 
grounds, this language alternation strategy, if 
transferred to the classroom, couid accomplish one of 
the most cherished but also least attained goals, not 
only in bilingual education, but in all education settings: 
that of brihglrig~closer together the school and the 
community, hence, the use of intersentential 
" code^witching'Vn the NCA class* makes 'an 'Wp'dnam ™ 
contribution toward balancing two communicative 
competencies — that of Spanish and English, both of 
which must be brought to a common denominator if 
one wishes to improve -the self-image of the bilingual 
•minority child. ' 

The LongTTerm.Objectives of the Demonstration Project 

One can identify in the cited project at least six distinct 
strands or objectives which, when interrelated with one 
another, represent the long-term goal of the Title VII 
Demonstration^ Project in Bilingual Instructional 
Methodology. More specifically* the author has 
attempted to address in it issues that can be identified 
under the following headings: 

(1) Methodology, 

(2) Ethnography, - * 

(3) Pedagogy, ; . 

(4) Education in'the broadest sense, 
. (5) Network of Information, 

(6) Evaluation. ' r 

Methodology is addressed in the sense that the 
potentials of the NCA Method are investigated. On the 
other hand, before conducting the teaching of content 
through alternation between the child's two languages, 
the teacher must come to grips with a series of related 
aspects. How is the child in this project assisted in 
reaching the threshold level in his/her home language? 
What is the child's attitude to* the. home and the 
majority>language as he/she strives for the threshold 
level? How does the child envision the future role of 
his/her two languages?. The latter question obviously 
relates to the equal or subordinated status of L 1 as an 
overall means of communication. 

The rapprochement between school and community 
discussed earlier is obviously a concern of ethnography 
if the investigation of parental language attitudes and 
patterns is included here. The acknowledgement of the 
importance of ethnography of communication (Hymes, 
1974) as a scholarly field of study and its incorporation . 
into this project points to the fact that the-children's 
attitudes to language in general .and their two 
languages in particular, can only be developed in school 
if support in this respect is also found in the homes and 
the immediate community. 



The training of the teaching staff is a necessity in a 
project as pedagogically demanding as this one. 
Pedagogy comprises two. different pha^s of the 
personnel's intellectual growth, inseryice training and 
continued education. In the former, teachers and aide^ 
are, acquiring the spptfific expertise to* successfully 
implement the teacmng method to which they may 
have been assigned; in the latter, they are upgrading 
"'• their "pfef^srgnal^edu c at ion -at the-umtergra(tote-or-— 
graduate level, whichever may apply. The advantages of 
providing training and College education to the 
participating teaching staff are many, some benefiting 
the teachers and aides, others the project itself. As the 
teaching staff become more knowledgeable, they also 
contribute more effectively. to the project. 

The impact of the citgd project on the schools, the 
school district and theHmmediate community is a 

- noteworthy aspect and addresses education in its 
broadest sense. More than merely accepting the project 
per se, one deals here with a much broader issue: the 
feasibility of bilingual education for minority children, 
as long > as the ingredients of such a. program are 
comparable to those qbserved-ln this project. Also, is 
the implementation of a project of this nature actually 
leading to the institutionalization of the approach on a 
wider scale?* As for the district under consideration 
here, this impact is already recognizable as additional 
bilingual classrooms are being established with local 
funds, and enrollment figures for the project grades are 
showing far greater stability than what is usually rioted 

^in comparable grades of the two schools. 

Finally, for the information on the project to become 
available to the interested educator, a hetworkhas been 
conceived that will allow participants to share the 
experiences being gained in the Demonstration Project 
Within this genera) framework, the awareness and the 
demonstrability phases are designed to. release 
information on its progress locally and also nationally. 
/ In addition, the project in operation shall be made 
available for site visitations in the hope that these may 
ultimately lead to the replication of the NCA model on 
school campuses where instructional innovation in the 
field of bilingual education is currently sought. 

Conclusion 

■ - \ * .•* 

It is expected that incoming data during ^he^ following 
years of the grant period will provide hard evidence on 
the greater success of second-language acquisition 
when the first language is also , being developed and a 
comparable success in conceptual learning' when 
bilingual children are made comfortable with the 
presence in their lives of two languages and two 
cultures, as .they grow up and learn to adjust to their 
roles in a mainstrearh setting. 
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BILINGUALS: MORPHOLOGY VS. SYNTAX - 
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1. .Introduction 

The most immediate aim of this paper is to explore and 
discuss the relationship, between the spontaneous and 
test speech of a sample of 10-12-year-old Hispanic 
bilinguals. This aim, in turn, serves a larger purpose of 
interpreting the implications of the findings for the 
testing of oral speech and the projection of school 
achievement in literacy. 

Certain questions of relevance to educational concerns 
are involved. In somewhat direct form, the two most 
prominent question are: 

1. What does an oral language test tell us about 
what a speaker knows, and can.be used for 
further language learning? ° 

2. What role does knowledge of oral language 
play in learning to read and write? 

As the title suggests, the answers to both of these 
questions depend on distinguishing various, 
components of language (components shared by both 
spoken "and written forms of language). The most 
fundamental components of language which carry 
meaning and yield to precise analysis are morphology 
and syntax. To anticipate later discussion, the 
distinction between morphological and syntactic 
behavior will reveal the theories of language 
acquisition, which have emphasized language 
development up to the age of six, have underestimated 
the complexity of the purposes which syntax serves 
and how knowledge of syntax is integrated into 
increasingly complex demands made on communi- 
cation in the Course of the social development of 
children into adults, whether or not literacy is part of 
this development. Syntactic behavior, in particular, vyill 
be shown to have relevance to certain widely known 
hypotheses concerning the relation of bilingualism to 
literacy .achievement among lower SES minority 
groups, hypotheses generated from information 
grounded exclusively in test language behavior. 

2. Basic concepts 

The project reported on here was designed to 
investigate the influence of situation and topic on the 
language behavior of 10-12-year-old Spanish-English 



bilinguals who are from lower-thaw-middle socio-, 
economic backgrounds in a highly concentrated 
Mexican-American community in ,Los Angeles. One of . 
the overall objectives of the study was to gain insight 
into which aspects of each language were most 
influenced in quality and/or quantity by changes , in 
situation. • \ 

It is well-known to sociolinguistic research that 
language behavior changes according to situation, but 
that not all aspects of language change in the same way 
(cf. Wald:1980, 1981 for reviews of sociolinggjMig.. 
studies v of the relation of language behavior to 
situation). Accordingly, we reasoned that, to the extent 
that language proficiency tests are intended to measure 
behavior representative of the actual language abilities 
(knowledge, competence, or resources) of speakers in 
communicative situations, some tests may be more 
appropriate than others for this purpose, depending on 
the particular aspects of language which are 
emphasized in.their test designs. For example, among 
the language proficiency tests widely used for 
classifying students as limited or fluent, the BSM 
(Bilingual Syntax Measure) and the BINL (Basic 
Inventory of Natural Language) are virtually 
diametrically opposed in their emphases (see Herbert, 

1979; Burt &Dulay, 1976), 

i 

1 . BSM : em phasizes morphology, i.e. word- 
composition, particularly the use of inflections, 

e g, 7a//+Past™ fell. 

k 

% - 2. BINL: morphology is intentionally ignored in 
favor of sentential syntax, i.e. the organization 
. of words injo sentences. The BINL measures 
mean length of utterance and syntactic 
. .. - ■ complexity. The use of subordination (the 
adjoining of additional clauses to some 
element of the mam clause of a sentence) is 
highly valued, e.g' the use of the relative 
clause, as in: there was a silly old monster who 
liked to (drink piffk lemonade. 

In discussing the language behavior of the speakers, it 
is useful to define and distinguish test and spontaneous 
speech as follows: 

"9est Speech: speech elicited by >pn oral language 
proficiency assessment instrument following this 



•An earlier version of this patter was presented at the Hispanic Research SIG of the AERA in New York City on March 20. 1982,1 would like to thank 
Jim Cummins for commenfs on the discussion of. his ideas. The research reported was funded by NIE through the National Center for Bilingual 
Research. However, this acknowledgement does not imply any agreement or endorsement of anything contained in this paper by the funding 
sources. ; k ' * • 
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instructions for administration of the instrument. In the 
present study, the Spanish and English versions of parts 
of the 8SM-1 and the LAS-1 (Language Assessment 
Scales) production task were used (see De Avila & 
Duncan, 1977). 

Spontaneous Speech: speech produced in peer 
situations where speakers were free to initiate and/or ' 
reject topics, and to speak in whichever language they 
preferred (during the first hour). 

Each type of speech is situationally distinct but both 
types reflect aspects of each speaker's total-language 
resources. 

At issue is the correspondence between the two types 
of speech, and subsequently 'the relevance of the 
correspondence 'for eduQational purposes, in particular 
tfie achievement of literacy. 

8elow " we consider, in turn, certain aspects -of 
morphology and syntax. % 



3. Morphplogical behavior 

For purpose of display, the speakers will be divided by 
age of arrival into three groups. Due to the narrow * 
age-range of the sample, 10- 12-year-olds, age of arrival 
is introconvertible with length of residence and grade of 
entry into the U.S. school system. The three ^ 
.ag.e-Qiraoiyai.aCQ ups wj 1 1 be ^ 

of arrival 0-6 years, mostly second generation), middlfa «r 
(age of arrival 6-8 years), and late (age of arrival 9+v v 
years), This grouping is a heuristic found to correspond ' \ 
closed to the language preference of the speakers in < 
spontaneous speech, so that English was preferred by 
earlier ages of arrivals, and Spanish was progressively 
preferred by later arrivals (cf. Wald, 1981a). * 

Figure 1 below displays the percentage of speakers in 
each group using the indicated morphological form in 
response to the 8SM sections designed to elicit the 
standard versions of the features. The figure only 
displays the speakers who responded . to the item in 
either a standard or non-standard form, not those who 
did not respond at all. " 



Figure 1 



Percentage of Speakers in Each Age of Arrival Group 
Using the Indicated Morphological Form in Response 
tothe BSM-1 English Prof iciency Test 



Percentage 
of speakers 
using 
indicated 
fo'rms \ 




ate (N-2 1/6/7) 

wants (N-20/7/3) 
fell (N— 21/5/6) 

■•* 

would have (N=22/8/6) 



early 



middle 
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Figure 1 shows a gradient pattern, indicating that 
speakers with earlier ages of arrival (or longer lengths of 
residence) more frequently use the morphological 
features tested. Such a pattern is familiar to many 
studies correlating various aspects, of language 
acquisition with length of residence (although there are 
studies which show that length of residence is an 
important indicator of linguistic development only for a 
while, about 4-5 years, among adults and adolescents, 
e,g. Heidelberger. 1978: Snow & Hqfnagel-Hohle. 1978). 
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The implication is that the use of these features reflect 
the acquisition of the features by the speakers. 

Similar gradience is found for the speakers in 
spontaneous speech. For example, Table 1 below gives 
the unweighted average (average Qf'individual means) 
of each group for English subject-verb agreement (the 
suffixation of - s to a present tense verb, with a third 
person singular subject, e.g. she looks vs. they look). 

144 . 



On the whole, correspondence between test and 
spontaneous speech preserves the gradience by age of 
arrival and length of residence. This indicates steady 
development of the morphological process with length/ 
of residence, ^ / 

A similar correspondence is found , with the use of 
< irregular .Jstcongl ~past<< tense < Jorms,,x>L,xQcb& 
spontaneous speech (represented in, BSM test speech - 
by-the pasts of the verb eat and fall). The LAS story 
retelling presented a greater number of possible 
contexts for strong pasts than the BSM. Table 2 below 
shows the closeness of fit between test and 
spontaneous speech for speakers whose use of the 
strong past was not fully developed in spontaneous 
speech. 

Table 2 indicates that as the number of possible 



contexts in test speech increases, the percentage of use 
converges with the percentage of use in spontaneous 
speech. At 4-16 tokens (elicited by the LAS story 
retelling), test behavior in irregular past use 
corresponds to spontaneous behavior with 86% 
accuracy. 

,,So far, ^the ^corre^RQacl^n^. between morphological, 
behavior in test and spontaneous speech looks good. 
The gradient patterns in both situations indicate thaN 
the tests are tapping actual developmental patterns. For 
these speakers and the communities thef^ represent, 
English morphology and its measurement in test 
situations is relatively unproblematic. 

At this point, we turn to a syntactic feature which 
shows a different patterp and requires a different 
explanation. $ 



TABLE 1 

Unweighted Average of Subject-Verb Agreement for the 
Three Age of Arrival Groups in Spontaneous Speech 



Age of Arrival 

0-5 " 
6-8' 
9+ 



Average 

99 
70 
55 



N Speakers 

' 22 
. 7 
3 



N Possible 
Contexts 

648 
135 
37 



(Table includes only speakers with at least 5 possible contexts each.! 



TABLE 2 , 

Correspondence between Test and Spontaneous Speech for 
Speakers Showing Variation in the Marking of Irregular Past Tenses 



Range (N of contexts) 
Average N contexts/speaker- 
Av%rage % difference 
from spontaneous speech 



Test - 






BSM 


LAS 


Spontaneous^ 


2-4 


4-15 


5-128 


2.0 


8.2 


48.6 


31% 


15% ( 


s 



/ 



- t 4. Syntactic behavior 

. The most crucial difference between syntax and 
morphology is in the difficulty of specifying possible 
contexts for complex syntactic features. Possible 
contexts jn morphology tend to be straight-forward and 
are specifiable in terms of linguistic contexts alone. An 
example given above is English subject-verb 
agreement. On the other hand, complex s\/ntax, which 
involves more than one clause in a single sentenced 
hot specifiable in this way. Various possibilittes often 

s. • • ■' : 
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exist for integrating a number of clauses into spoken 
discourse. To use the example to be discussed at length 
below, the information contained in a relative clause 
can also be i expressed in a main clause without 
violating any known linguistic norms of any form of 
spoken or written English or Spanish. Both of these 
languages show a great deal of surface structural 
similarity for this and many other features of syntax. 
Example (1) below shows relative,clauses used in both ' 
languages in test speech. 
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(1) a. It was about a silly old monstef who useta 
like m pink lemonade, (JS 1 2m) 
b. Se trata de una giganta que le qustaban 

los helados de-de fresas, (EG 1 20 
(Parentheses, following the citations identify 
the speaker by code name, age, and sex, in that 



The same content can also be expressed in two 
independent clauses, as exemplified by the test speech 
in (2) below. 



(2) a. 



b. 



Qnce there was ... a big monster n he liked 
... lime ice cream. (YL lit) 
. . , habia una giganta . . . y -este- le gustaba 
las fresas. (JS 12m) 



* • ■ / *• 

In measuring syntactic complexity, testers /tend to 
disregard the problem of possible context in favor of 
.scoring for frequency of use of all 'syntactic 
constructions, Complex syntax Is highly valued by tests 
concerned with syntax,' so/ that the exarryles In (1) 
contribute mqre"to a syntactic score than the example^ 
in (2) above. / 



Following this tradition/figure 2 below displays the 
percentage of speakers in each group who used a 
relative clause structure at least once in the 'English 
retelling of the LAS-1 English story. The LAS-1 stimulus 
itself contains three instance^ of the relative clause 
. structure, , , / 





Figure 2 f j 




, Percentage of Speakers in Each Age of Arrival Group 
Using the Relative Clause Structure At Least Onc$ in the 
Retelling of the LAS-1 English Story 
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The striking feature of the pattern is the dip in the 
jniddle group, those speakers first exposed to a 
predominantly English environment at ages 6-8 years 
and enteringahe school system between ffrst and third ^ 
grade. Since we have divided the sample of continuous 
ages 6f arrival into three groups^t is convenient to refer 
to this pattern as the pattern of the depressed middle 
group. 

The pattern is striking because we might have expected 
to find a gradient pattern suggesting that age of arrival 
(or length of residence) rhonotonically influences 
development and use of relative clauses in English. The 
assumption that leads to this expectation implicitly 
equates use in test speech with acquisition and 
development of the feature. At first glance, this seems 
reasonable if we transfer our expectations frdm 
morphological features of test speech discussed in the 
preceding section. Although our data on spontaneous 
speech allows us to investigate this assumption 
directly, it is useful to try to explain the pattern in terms 
of a number of recent and current hypotheses which are 
limited to test language behavior. It will turn out that , 



these/ hypotheses are inadequate. Nevertheless, they 
are /worth considering 'because they^have been 
influential in the recent educational research on 
bilihgualism, and because the observations on which 
they are based must be explained in some way. These • 
hypotheses are discussed immediately below. 

* . - 

/5. Cross-Language Hypotheses; Development and 

/ Transfer 

/ One of the crudest hypotheses that can be brought to 
bear on the depressed middle . group pattern is 
Skutnabb-Kangas and Toukomaa's semilingual 
hypothesis. Based on a study of tested Swedish 
language skills of Finns in several Swedish primary 
schools, Skutnabb-Kangas and Toukomaa conclude 
that middle age of arrival Finns, especially those 
arriving between the ages of 7-8 years, are less likely to 

• develop language skills equivalent to monolingual 

• Swedes than either those Finns who arrived before the 
age of 6 or those who arrived at age 10+ years/ if . 
educated only in Swedish after arrival (Skutnabb- 
Kangas and Toukomaa, 1976,75). Skutnabb-Kangas and 
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Toukomaa's hypothesis implies that the arrivals before 
age 6 have an advantage over the age 7*8 arrivals, 
presumably because of their headstart In the 
acquisition of Swedish, but that those Finns who arrive 
at later ages, 10+, are in an even more advantageous 
positjon to learn Swedish, presumably due to the lack of 
interruption of the development of their Finnish 
language skills and the ease with which those skills can 
be transferred to Swedish at older ages. 



As quickly pointed out by critics of tne hypothesis * 
familiar with the findings of sociolinguistic research, the t 
semilingual hypothesis is extremely crude since it is 
creates the ifripression in consumer? of the theory that 
the so-called "semilingual" Finns are less able to 
express themselves in eitfler language than 
monolingual Finns Or Swedes (cf. Brent-Palmer 1979; 
Leap, 1979). Since the Finnish findings are restricted to 
tests in academic contexts, extension of the findings to 
spontaneous speech is unwarranted. 

In a number of influential papers adding original 
analyses from Cana^an L2 studies, Cummins modified 
the semilingual hypothesis in a direction responsive to 
the sociolinguistic criticisms,*- while attempting to 
preserve its apparent insights. Particularly in a . 
reanalysis of a study by Wright & Ramsey of a mixed 
batch of L2 English speakers in fifth, seventh and ninth 
grade in the Toronto school system, Cummins nsied 
that when leng th of re s i danc e yvas held constant, older % 
students showed a 9>^S8feiQ£ y|t0 P erfornn better 
many language skills' ]WHp]M^unger studehts (se^ 
especially Cummins, 1^;^For example, Cummins 
observes that, as "a group, students arriving at the 
adolescent ages of 14-15 learned more vocabulary in 
one year, as measured by a picture vocabulary test, 
than students arriving at ages 4-5 learned in 7 years (op 
cit, 146). This leads to the conclusion .that older 
speakers are learning the same skills at a much more 
rapid rate. 



Cummins further noted that not all language skills favor 
older over younger speakers in rate of development. For 
example, language skills such as native-like 
pronunciation, fluency, and oral comprehension appear 
to favor younger speakers, accordtpg to the studies 
Cummins reviews, while vocabulary, sound 
discrimination, morphology, and the aspects of syntax 
implicit in the test items appear to favor older speaker? 
(cf. Cummins, 1980). Using the criterion of whether rate 
of development favored younger or older speakers, 
Cummins suggested that languaige skHls favoring older 
over, younger speakers are different in kind from the 
other language skills and that tip/ are more relevant^) 
school achievement (Cummins, T980, 1 77ff). 

By positing differences in relevance of different aspects 
of language to academic achievement, measured by 
test performance, Cummins constrains the semilingual 
issue more clearly t<ian Skutnabb-Kangas & Toukomaa. 
However, unclarities still remain about precisely what 
skills are most relevant to school achievement, and, 
even more importantly, 'why this is so. This point will 
recur as discussion proceeds. 



Cummins reformulates Skutnabb-Kangas & Toukomaa's 
hypothesis with mutfi greater precision In the 
interdependence hypothesis-, According to this 
hypothesis, those skills which are most relevant to 
academic achievement can be transferred from L1 to L2 
at a rapid rate as the speaker approaches adolescence. 
On the other hand, Cummins suggests that, for speakers 
from minority population historically discriminated^ 
against and concentrated in subordinate statuses in the 
society represented by the educational system 
(conditions fulfilled by both the majorities of the Finnish 
community in Sweden and the Mexican-American 
community among others), if students do not acquire 
the relevant skills in L1 first, they will take much longer 
to develop in L2, and consequently the student will fall 
further* behind grade level (cf, especially Cummins, 
forthcoming, 12ff). 

The interdependence hypothesis has two c basic 
features: developmental transfer. If this hypothesis is 
applied to the pattern of Figure 2 above, the simplest 
interpretation would be as follows: 

Both extreme groups (i.e. the early and late groups) 
show similar relative clause use because they have 
similar relative clause development The early group 
shows relatively advanced English relative clause 
development because early acquisition of English has 
allowed sufficient time for the structure to develop. The 
late group shows English relative clause development 
comparable to the early group because its speakers 
have had sufficient time to develop the structure in 
Spanish and have been able to transfer that skill 
relatively quickly as their English develops. 

Following, this logic, the depressed position of the 
middle group reflects a disruption of Spanish syntactic 
development without compensation in English 
development, due to a relatively late age of arrival and 
short length of residence compared to the early group. 

6, Group Cross-Language Comparison 

If the interdependence hypothesis is taken literally, we 
would expect to find a patterning of relative clause use 
across groups in Spanish similar to th6 English one. 

Figure 3 tfelow superimposes the pattern of relative 
clause use in Spanish on the English pattern already 
seen in Figure 1* .above, "for the same speakers. The 
Spanish data comes from the Spanish version of the 
LAS story retelling task. The Spanish LAS stimulus 
story is identical to the English LAS story in all relevant 
respects, up to the use of the same three stimulus 
relative clauses in their Spanish guises. 

Unlike the English pattern of the depressed middle 
group, the Spanish pattern shows the gradience usually 
associated with monotonic development. 

While the comparison of the Spanish and English 
patterns does not confirm the interdependence 
. hypothesis, it does not disconfirm it either For example, 
one might argue as follows: x * ' 
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The early group show* equivalent development in 
Spanish and English, but the later groups show more 
development in Spanish, In this case, the problem of^ 
transfer ts restricted to those who have developed tho 
relative clause In Spanish, but not In English, One might 
propose that those who have developed the relative 
«-^uee-^f^Spantahr--buHrw>t--+n English, have not 
developed their English well enough to effect transfer. 



itilSsma 



THis atilNmakes „ the , middle group look the most 
uriderdeveloped, since they show relatively little 
difference from the early group in apparent Spanish 



development, along with a relatively depressed 
development In Engttah, Considering the longer length 
of residence of the middle group, along with the 
apparent depressed development of their English skill 
compared to the late group, the middle group appears 
not only to be relatively lacking In development, but 
also In ability to transfer across languagesrFlnolly, the 
apparent lack of development of many members of the 
early group In either language makes them lookJnferlor 
to the late group, and only superior to the middle group 
in English, 



\ Figure 3 

Percentage of Speakers in Each Age of Arrival Group Using the 
' Relative Clause Structure -At-Least Once in the 
Retelling of the LAS- 1 Stories in the Language Indicated . 
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7. Individual Cross-Language Comparison 

The finer resolution of Table 3 below alters the 
perspective of Figure 3 to some extent. 

Here we see the actual match of use of the relative 
clause in English and Spanish for the individuals in each 
group. 

The interdependence of relative clause development in 
Spanish and English is'now.seen to be very unclear for 
the early group. Very few speakers show use of the 
structure in both languages. Most use it in only one of 
the two languages withput any appreciable* difference 
of likelihood in either language. On the other hand, the 
middle and late groups preserve the impression that the 
relative ciause must develop in Spanish before ft 
develops in English. This allows us to continue to 
maintain that for non-early English speakers, the 
structure . develops first in Spanish and is then 
transferred to English, rather than independently 
developing in English ^regardless of Spanish 
development. 
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The depressed middle grOup pattern is reflected in the 
proportion of members of eactujroup who, used the 
structure in neither language. The middle group shows 
the greatest proportion of individuals failing to exhibit 
development in either language. This is followed by .the 
early group, which shows less use than the late group in 
at least one language. 

8. Syntax: Jest Speech Compared to Spontaneous 
Speech j 

Table 4 below now adds the situational variable to 
relative clause use of the students. It compares use,of 
the' structure in test speech, discussed above, with its 
use in spontaneous speech. 

The results reveal serious 'discrepancies between 
observed performance in spontaneous speech and in 
test speech. Only two speakers show no evidence of the 
relative clause in any observed context. Both of these 
speakers were members of the early group. They talked 
relatively little in either language. For them, as for all the 
others we cannot assert that failure to use the relative 
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clause in either language implies failure to have 
developed t the structure All olhar apeakera had* 
^-developed the structure in at leaat one language Thus, 
the patterns ol use reflected in the test speech are not 
reliable Indicators of spontaneous, speech for this ... 
jvntactic* structure Most importantly, the pauerns of 
test speech are not accurate indicators Of the linguistic 
knowledge and resource underfying the use of this 
syntactic structure in either spontaneous or test speech. 



These results immediately belie any attempted 
semilihgual hypothesis in explanation of the lest 
speech patterns For the interdependence hypothesis, 
the question becomes not only the problem of transfer 
across languages, but of transfer across situations 

group* speakers used the relative clause in English 
spontaneous speech, only one of them transferred this 
use to the test situation. 



TABLE 3 

Com^riaon of RC Use In Spanish and English Story-Retelling by ADA 
JjA Spanish Only Both English Only Nslths r N_ 



0-5 
6-8 
9+ 



.27 
43 
57 



.22 
.14 
.43 



22 
00 
.00 



.27 (22) 
.43 (7) 
.00 (7) 



TABLE 4 

Comparison of Use of RC in LAS-E and -S Story-Retelling and 
Use of RC in English Spontaneous Speech 



Sp. Only 



RC useiniAS 
Both 



Eng. Only 



Neither 



Percent of Speakers .62 
using English RC in D1 (13) 
(N) 



C 



.55 
(9) 



100 

(5) 



.78 
(9) 



9. Discussion 

This final iection discusses the implications of the 
behavior discussed above for relevance to educational 
concerns. 

A. The use of test speech without control or possible 
contexts for complex syntax leads to an unreliable 
measure of the actual language resources of the 
speaker tested. 

First the possibility of using complex syntax may 
depend on how much the speaker chooses to say. 
Setting aside actual language resource^, one 
cannot safely assume that all students appfoach 
the same task . of story retelling or, picture 
description with the same interest or enthusiasm. 

We may expect syntax to be richest when the 
speaker has greater access than the tester to the 
information to be expressed in speech (cf. Guthrie & 
Steffersen, 1980). To the extent that syntactic 
organization reflects the speaker's evaluation of 



— ~ 1" 

how information should be presented (e.g. by a 
subordinate or independent clause}, the speaker's 
additional task of guessing 'Which information is 
most important in the test situation (communicat- 
ing to a tester wtuwnay be assumed to already 
know the informatio* complicates the processvof 
formulating a test response. The speaker must not 
only evaluate which information to present, but 
how to frame that information in some syntactic 
form. m 

The lack of control of possible context for relative 
clause vs. independent (or conjoined) sentence, as 
in- (1) and (2) above, can be contrasted with the 
control of possible context within narrower, more 
rigorously defined syntactic contexts. For example, 
in the (literacy standards of both English and 
Spanish, relativization on the object of a preposition 
allows the preposition to precede the relativp 
marker (literary Spanish requires it while liter#y 
English is more flexible in allowing alternation 
between the two syntactic types), as illustrated in 
(3) below: 
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Car about which\\*\4 vou. 
el hombr© de qiMntqu* tyo) °* k) Y 
hahlando . 

However, in spontaneous speech from all speakers 

ior.wWn. prfiPQfM ^seiyt^ 

the constructions represented in (3) were navar 
observed The 1 equivalents are presented in (4) 
below 

(4) A the car (ttu$t)\ jutit told you about (Al 120 
b el qua yo aatoy hablando (OM 1 Om) 

In the speakers' spontaneous English, tho 
preposition waa always, left stranded In the 
equivalent Spanish, it was always unexpressed • 

Becauie relativization tfn a prepositional objective 
is BtricuVsdefined in terms of linguistic elements, 
the possible context for the literary construction is 
clear Speakers' use of the types in (4). but not in (3). 
indicates that the sppken versions of English and 
Spanish used by tho speakers differ from the 
standard literary versions of both languages for this 
feature- 
In contrast, tho possible contexts which include the 
relative clauses of (1) and tho adjacent main clause 
pairs of (2) cannot be strictly defined in linguistic 
terms. No matter how many adjacent main clause 
pairs we find, we cannot conclude that the speaker 
has not acquired the relative clause structure. 

It is possible to create favorable contexts for 
eliciting the relative clause structure, e.g. by asking 
"what's a teacher?" ^expected answer; somebody 
who teaches, cf. Chiang. 19B0). However, the 
holistic measurement of syntactic development 
would require a large number of specially designed 
discrete-point items. Until proficiency testing has 
decided precisely what points of syntax it wants' to 
measure for what purposes, it is unlikely that such a 
test will be widely used in evaluating students* 
language proficiency for academic purposes. But 
until such decisions are made, language proficiency 
testing for syntax continues to rtin the risk of 
seriously underestimating the students linguistic 
resources and supporting programs which retard 
the student's academic progress in either or both 
languages. 

B. The transfer aspect of the bilingual inter- 
dependence hypothesis is complex. It applies not 
only to transfer across languages, but also to 
transfer within a single language^acrofcs situations. 

From the perspective presented here, the primary 
value of the interdependence hypothesis is that it 
recognizes that different aspects of a language 
develop independently and are differentially related 
to literacy achievement. In assigning special 
attention to features of language which show a 
more rapid rate of development in L2 for older 
speakers, it has potential for contributing to the 
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development of rational, well-motivated program^ 
tor teaching oral- and literacy related skills to 
limited-English speaker* 

One case where it may be justified in its 
assignment of differential importance to aspects of 
language for the purple Of lit^V is m Its 
discounting ,of pronunciation as an important 
literacy related skill The common observation that 
native like pronunciation of 12 is developed more 
rapidly by younger speakers implies that it is not 
among the most important literacy-related skills, 
according to the hypothesis. This has direct 
implications for the procedures used to teach 
reading Convergent evidence comes from a 
microethnographic study reported by Moll (1981) 
Mull discuses a group of sophisticated .third grade 
readers able to draw inferences from their Spanish- 
reading material. Drawing inferences is a high-level 
reading skill " At the same time, in a totally 
independent English reading class, the same 
students are only ' required to practice 
pronunciation and to perform low-level decoding 
tasks The implication is that these students, who 
arc capable of much more demanding tasks in 
English reading, are held back by a methodology 
which place$ unduJy heavy emphasis on phonics 
and narrow decoding skills, as if reading skills in the 
two languages wore independent of each other. 
The issue of unduly emphasizing pronunciation and 
even of misinterpreting pronunciation differences 
between standard and nonstandard varieties of a 
language as decoding errors has been identified by 
L6bov (1972) for teachers of students, speaking 
black English vernacular. Thus, pronunication 
represents a case of convergence between the 
predictions o( interdependence hypothesis and the 
observations .of scholars concerned with 
misinterpretations of language behavior .in 
instructional situations. 



However, the findings reported in this paper 
indicate that conclusions about the role of complex 
syntax in literacy achievement may be of a quite 
different order. The present case of the relative 
clause suggests that many tests do not simply test 
the language resources of students, but also their 
test-taking abilities. The relative clause, a feature of 
high value in the test-taking situation, is commonly 
used in spontaneous speech by virtually all. 
speakers, but reflects a pattern associated with 
literacy-related language skills in test speech. It is, 
therefore, not a straightforward task to separate 
aspects of L2 and claim that mey have a faster rate 
of development for older speakers without adding 
in a test situation 

The Evidence suggests that- syntactic behavior is 
very much influenced by features related to the 
. school situation, both in testing and achieving 
literacy. It follows that difficulties in the acquisition 
of literacy and in language proficiency testing 
proceed not only from lack of language 
development, but also, and perhaps more 

loo 



commonly, from difficulties in transferring already 
developed language resources- to academic 
situations. Thus, the route to improved literacy 
achievement must include identification Bnd 
academic development of resources which the 
speaker already possesses. How. to-do this will be 
labelled the transfer problem. 

Production and corpprehension of complex syntax^ 
is a literacy-related skill. The relation between 
syntactic complexity in vernacular and literate 
standard languages needs to be recognized in order 
to solve the transfer problem. 

Complex syntax contributes, to the organization of 
information in spoken discourse and written texts. _ 
The written language has a highly developed 
hierarchical information structure which, in its full 
form in the school textbook, includes chapters, 
sections, subsections, paragraphs and sentences; 
The sentences include those with more than one 
clause, either conjoined or subordinate. 
Recognition of this organization and ability to use it 
for information retrieval, is a high-level goal of 
reading instruction. It is used, for example/for 
finding passages in a text whicb support inferences 
made tf^gftd the actual text, as well as for locating 
information actually expressed in the text. . 
Complementarity, a high-level goal of writing 
instruction is to provide students- with the 
knowledge necessary to reproduce this information 
structure (e.g. in the extended book report). 

The relation of the organization of spoken to written 
language is the subject of some current* research. 
Evidence is beginning to accumulate that there are 
differences between some of the devices used to 
organize information in the spoken language of 
someNower SES communities and the standard 
written language, and that primary school students 
transfer these spoken devices to the written 
language in violation of the standard literary norYns 
(cf. Michaels, 1981). 

At the same time, the unqualified assertion that 
written language is. less contextualiy dependent 
than spoken language has-been challenged for 
some genres of speech and writing by some 
scholars (e.g. Prince, 1^81 contra Olson, 1.977;. 
Fillmore and Kay ? s work on inferencing strategies 
used in real-time jeading and in choosing among 
multiple choice answers is also relevant, cf. 
Fillmore & Kay. 1980). Certainly the stereotype that 
characterizes spoken languages as largely 
dependent on what fs present in the speech 
situation has been exaggerated where school-age 
children are concerned. In our own research, we 
\found that virtually all students 1n the 10-12 age 
range are highly capable of formulating complex 
narratives of/personal experience about people and 
eVents not preserif of evident in any way in the ^ 
speej| situation. 

Within the domain of the sentence, there has long 



been evidence that cognltively and structurally 
complex forms are used by speakers of all SES 
levels, whether or not they conform to standard 
written norms (cf. labov, 1972), 4 

In the case of the relative clause, as an example of 
complex syntax, there is some evidence accruing 
that it is sensitive to' medium so that it tends to be 
used more often in vyritten than in spoken 
presentation of the same informational content (A. 
Kroch, p.c. on a University of Pennsylvania project 
comparing spoken and written versi ons of the same 
text produced by undergraduate, largely working 
and lower middle class, college students). This 
connection between structures like the relative 
clause and literacy indicates why it is highly valued 
* by some test designs. 

Our findings show that there is a non-literate 
spoken base for the relative clause (among other 
types of complex syntax). For the most part, the^ 
educational problem is not one of teaching the form 
of the relative clause or even of transferring its form 
from one language to another (for L1 Spanish 
speakers), but rather of teaching the difference 
between the uses of the relative clause (and other 
. syntactic features) in spoken and written language. 
Toward this goal, it is essential to know the" 
difference between the vernacular uses in the 
community served and the uses in the standard 
language. 

As a final note, it is also important to recognize 
whatever formal differences occur between written 
and spoken language. '* As discussed above, the 
vernacular forms of Spanish and English of the 
community of speakers studied differ from the 
standard forms of both languages in relativizing on 
a prepositional object (as in (3) and (4) above). The 
standard forms of Spanish and English are 
congruent across these languages as a result of a 
Common origin in the game Latinate literary 
tradition. The vernacular forms have evolved 
through an independent spoken tradition. In the 
case of English, the vernacular form is much older 
than the standard form, and reflects a durable 
tradition acquired by L1 Spanish .speakers in L2 
i/w'r/7ouMrahsference from Spanish (cf. Jespersen, 
1965 on the history of the different forms of the 
relative clause in English). This indicates 
acquisition of these features of 02 is effectively 
taking place outside of the academic context. The 
impact of vernacular syntactic forms of English for 
the bilinguals discussed here stands in striking 
contrast to the lack of impact of syntactic forms 
restricted to the standard literary language. The 
standard English form of relativization on a 
preposition is already found in second grade 
textbooks, although we have seen that it is not used 
in speech even at the sixth grade level (cf. 
Langj^ge Skills Framework, 1980:382): 



Possible difficulty in interpreting the standard 
construction in reading on thg part of such students 
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is not necessarily the* result of bilingualism 
(alth<jugh the first impulse of the educator 
insulated from the realities of vernacular speech is 
often to draw that conclusion), but rather of the 

^acquisition of a nonstandard form of English shared 

. by most lower SES groups. 

yfhe consequences of our findings are: 

1. Hypotheses about syntactic development 
based %n ^ test speech exclusively, while 
possibly symptomatic- of literacy-related 
problegn^^do not rest on a reliable data base as 
indicators of language resources of bilingual 
students of lower SES backgrounds. 

2. Transference of language resources across 
both languages and situations are literacy- 
related problems which must be distinguished 



from developmental problems in oral second 
language development. 

*3. Differences between standard and non- 
standard syntax known to exist in many 
monolingual English (and Spanish) also exist in • 
(at least) some lower- SES bilingual communi- 
ties and must be Carefully distinguished from 
differences caused by direct bilingual trans- * 
ference. • . 

In separating the various sources of problems 
involving language resources toward the goal of 
literacy, it is necessary to develop a firm 
$ informational base . distinguishing both forms and 
uses of syntactic structures in vernacular and 
standard, languages — no less for bilihguals than 
for anyone else. 



* ' References 



Brent-Palmer. C "A Sociolinguistic Assessment of the 
Notiop 'Immigrant Semilingualism' from a Social 
•Conflict Perspective." Working Papers on 
Bilingualism. 17:135-180. April. 1979. 

Burt, M. & Dulay. H. Technical Manual for the Bilingual 
Syntax Measure. New York: Harcourt. Brace and 
Jovanovich. Inc.. 1976. 

Chiang. D.L "Predictors of Relative Clause Production." 
In S. Krashen & R.^Scarcell eds.. Research in Second 
Language Acquisition. Rowley, Mass.: NewSurn 
House. 142-145. 1980. 

Cummins. J. "The Cross-Lingual Dimensions of 
Language Proficiency: Implications for Bilingual 
Education and the Optimal Age Isisue.V TESOL 
Quarterly. 14:2. 175-187. 1980. - . * 

Cummins. J. "Age on Arrival and Immigrant Second 
Language in Canada: A Reassessment." Applied 
Linguistics. 11:2, 132-149. 1981 . 

Cummins, J. "The .Effectiveness of Bilingual Edi/cation." 
To appear in Focus, forthcoming. \ • 

De Avila. E.A. & JXmcan. S.E. ''Language Assessment 
Scales." LAS 7: Examiner's Manual. Corte Madera, 
Ca: Linguametrics Group. Inc. 1977. 

Fillmore. C. & Kay. P. "Progress Report: Text Semantic 
. Analysis of Reading Comprehension . Tests." UC 
Berkeley Depts. of Linguistics and Anthropology.- 
Unpublished ms, 1980. & 



Guthrie. L. & Steffersen. M.S. "Effect of Situation on the 
Verbalization of Black Inner-City Children." Paper 
presented at the LSA Summer Meeting. 
Albuquerque, NM Aug. 3. 1 980. ~ ' 

Heidelberger Sprachsforschung : frojekt. "The 
Acquisition of German Syntax by Foreign Workers." 
In D. Sankoff. ed. Linguistic Variation: Models and 
friethgds. New York: Academic Press. 1 978. 1 -22. 

Herbert. C.H. Basic Inventory of Natural Language 
(BIN.L). Instructions Manual. Revised. San 
Bernardino. CA'.CHECpointSystemsJnc.. 1979. 

Jerspersen. . O. A Modern English Grammar. 
London:George Allen & Unwin, 1 965. 

Labov. W. "The Logic of Nonstandard English." In 
Language in the Inner City: Studies in the Black 
English Vernacular. Philadelphia: University of 
Pennsylvania Press. 1972. 

Language Skills Framework. Vol 1: Oral Language: 
Resources for Developing a Student Placement 
System for Bilingual Programs. SWRL project for the 
U.S. Office of Education. Los Aiamitos. March. 1980. 



Leap. W. "On the Implicit Politics 
Research: Indian English and the 
Conspiracy'." Unpublished ms.. 1 979. 



of Educational 
'Semilingualism 



Michaels. S. "'Sharing Time f : Children's Narrative Styles 
and Differential Access to Literacy." Languages in 
Society. 10:3.423-442.. 1981; > 

160- ■ ■ ' 1 srn 



Michaels, S. & Kadar, L "Children's Discourse Style and 
Differential Assess to Literacy." Paper presented at 
the annual meeting pf the American Association of 
' Anthropolgists. December, 1981. 

Moll,, LC. "The Microethnographfc Study of Bilingual 
Schooling." In R.V. Padilla, ed., Ethnoperspectives in 
Bilingbal Education Research, Vol. lit: Bilingual 
Educatibn Technology Yps\\ant\: Eastern Michigan 
University/ 1981. 430-446. 

,blson, D. "From Utterance to Text; The Bias of Language 
in Speech and . Writing" Harvard Educational 
Review, 47,257-281. 1977. 

Padilla, R.V., ed. Ethnoperspectives in Bilingual 
Education Resf&ch, Vol III: Bilingual Education 
Technology, Ypsilanti: Eastern Michigan University, 
1981. 

Prince, E. "Toward a Taxonomy of Given-New 
Information." In P. Cole, ed., Radical Pragmatics New 
York: Academic Press. 1 981 , Pp. 223.-255. 



Skutnabb-Kangas, T & Toukomaa, P, Teaching Migrant 
Children's Mother Tongue and Learning, the 
Language of the Host Country in the Context of the 
Sociocultural Situation of the Migrant Family Dept. 
of Sociology and Social Psychology. University of 
Tempore, Finland, 1976. * ■ • 

Snow, C.E. t & Hoefnagel-Hohle, M. "The Critical Period 
for Language Acquisition: Evidence from Second 
Language Learning." Child Development, 49; 
1114-1128. 1978. . " ' • 

Wald, B. "Report on the Study of Limited Language 
Proficiency." Working Paper Series. National Center 
for Bilingual Research. 1980. ■ • 

Wald, B. "Reaction to Troike's 'SCALP'." Presented at the 
Language Proficiency Assessment Symposium. 
Warrenton, Va. March, 1 981 . 

Wald, B; "The Relation of Topic/Situation Sensitivity to 
the Study of Language Proficiency." In R.V. .Padilla, 
ed., Ethnoperspectives in Bilingual Education 
Research, Vol III; Bilingual Education Technology. 
Ypsilanti: Eastern Michigan University, 281 -30& 
1981a. : 



GRADE LEVEL AND ENGLISH ACQUISITION: 
AGE, YEAR OF BIRTH, OR SOMETHING ELSE? 



Gerald E. DeMauro 
New Jersey State Department of Education 



Abstract 

Recent analyses of achievement in New Jersey's 
categorical programs have shown that the amount that 
students gain, relative to test norms, is related to grade 
level. Younger students make the largest gains. The 
characteristic grade function seems to describe gains 
made by New Jersey bilingual education students on 
the Language Assessment Battery (Cumbo, O'Neill, 
Tilis and Weichun, 1976). However, on closer 
examination, differences in scoring gains reflect 
changes in test levels, rather than changes in grades. 
The phenomenon is related to changes in aptitude 
noted on recent intelligence tests. Younger children 
have been scoring higher on these tests in recent years 
(Herman, 1 979). Perhaps cultural influences are causing 
temporary verbal aptitude changes. Decreases in these 
changes over time (Lazar and Darlington, 1982) may 
coincide with declines in the , rate of language 
acquisition over grades. Other theories including the 
influence of acculturation, etc., are presented as 
possible.gllernatd e^lanajtions of the grade effect. 

Gains made in ..reading r >(Figure .1) and math (Figure 2) 
scores by New Jersey's state compensatory education 
and Title I students in recent years have been negatively 
related to grade level (DeMauro, 1980/1981a). Since 
analyses are conducted in Normal Curve Equivalent 
(NCE) scores, which gauge student performance 
relative to test norms (Tallr«.adge, 1 976), these results 
show that younger program students make, larger 
improvements relative to grade peers. 

Echternacht (1980) reports that early attempts were 
made to standardize Title I program effects by 
transforming mean student scale scores from 23 states 
on fall; 1977 pretests and spring, 1978 posttests to 
NCEs. The results, termed "normal growth." agree with 
New Jersey's findings of greatest gains jri the lowest 
grades. 1 . ' 

Similar results, are common in bilingual education. 
Douglas and Johnson (1981) claim that reading and 
math gains of first and second grade Title VH students 
surpass what could be expected from the test norms. 
The effect has some generality over academic years and 
treatment populations. 

In a 1 979-80 study of English acquisition by New Jersey 
bilingual students'(DeMauro, 1981b), the relationship of 



gains in NCEs to grade level was similar except/ in 
grades 1 and 7. The scores in Figures 1 and 2 represent 
means of average grade level gains reported toth« New 
Jersey Department of Education by local districts. The 
scores in Figure 3 are student mean gains. 

The data in- Figure 3 are derived from the fall, 1979 and 
spring, 1980 administrations of the Language 
Assessment Battery* (Cumbo, O'Neill, Tilis and 
Weichun, V976) to 729 bilingual education students. 
The sample, from grades 1-12, was stratified to 
represent New Jersey's bilingual education population 
in educational environment (pullout, center approach, 
etc.), socioeconomic status, region of the state, and 
years of program experience. 

The English skills acquisition data is the fpcus of the* 
discussion because? - * 

a) The single instrument requires no assumptions 
concerning the equivalence of NCE scores from 
various tests; ' 

. b) The sample represents New Jersey's bilingual 
education population; * 

c) The sample stratification permits analyses of 
meaningful effects,, such as program 
experience. 

The current paper examines grade level (and age) as a 
determinant of English skills acquisition; by bilingual 
education students. It discusses the general grade 
effect evidence when standardized scores (such as 
NCEs) are used as well as other gradaeffects specific to 
the acquisition of English as a second language. Finally, 
several cognitive and noncognitive explanations for the 
observed effects are discussed. First, the effect is 
defined as accurately as possible through covariance. 

*Regressibn Scores ~ 

To reduce 'the large error component of difference 
scores (Thorndike and Hagen, 1969), the pretest was 
covaried from, the posttest of the New Jersey English 
acquisition data and the adjusted means were plotted 
by grade and years of program experience (Figure 4). 



The author recommends caution as program selection was based on pretest scores, and some gain may v be attributed to regression to the mean. 



^his instrument has four subtests. listening, speaking, reading and writing. The scores used in analyses were NCEs derived from New York City's 
total test norms. 
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Figure 1 <; 

Mean Math Gains (in NCEs) of 
New Jersey Title I and State Compensatory - 
Education Students, 1978-1980, By Grade > ' S 
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Figuro 3 

Mean Gains (in NCEs) of 
a Representative Sample of 
New Jersey Bilingual Education Students 
on the LAB, 1979-1 980 
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Figure 4 

Adjusted Posttest Means (in NCEs) of a 
Representative Sample of New Jersey 
Bilingual Education Students on 
-the LAB, 1979-1980 
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The LAB Is divided into thre& levels, spanning grades 
K-2: 3*6, and. 7-12. The covarlance , analysis is 
conducted within oach lovol. Main effects are program . 
experience and grade, tho dependent variable ls\the . 
posttest and the covariate is the pretest. 

Significant effects are only found in level 1, The grade 
effect (F (1,257)-5.09, p<*.025) shows that second 
graders outscored first graders. Students with more 
program experience outscored those with less (no first 
graders had three years of experience) (F (3,257)-3.56, 
p<.015), The grade and program experience 
interaction (F(3,257)-3.93, p<.048) shows that more 
experienced students scored higher in second grade, 
but lower in first, 

As shown in Figure 4, the characteristic grade effect is 
better described in these data as differences among test 
levels and not grades Within levels. One major 
assumption underlying the use of English norms is that 
tests that are more difficult for monolingual English 
-students are proportionally, more difficult for 
limited-English-proficient (LEP) students. Therefore, a 
harder level of a test shogld yield the same distribution 
of NCE scores because the test is harder for both the 
English norming sample and the LEP target population. 
In fact, a more difficult test may depress LEP student 
performance much more than it depresses the 
performance of the monolingual English norms groups 
This would explain the relatively poorer performance of 
LEP students on different levels of a test that is norme.d 
on monolingual English samples of equivalent abilities. 
Although the LAB standardization procedures 
attempted to control for student abilities across levels 
(see the technical manual), these levels may have had 
varying difficulty for LEP students. ^ 

Alternately, the differences in scoring across levels may 
reflect real differences in language acquisition that 
coincide with test levels. Younger children may learn 
English more quickly. This argument is consistent with 
the fact that grade has been a significant effect in all of 
the achievement analyses in New Jersey and is not an 
artifact of a particular test. Moreover, analyses of 
English language achievement on various tests 
administered by New Jersey local bilingual education 
programs show that grade is significant both as a main 
effect (F (3 70)- 10.241, p<.001) and in interaction 
with pretest'to posttest gains (f (3,71)«7.028, p< .001) 
(Table 1). 



Critical Period Hypothesis 

Perhaps Figure 3 may be explained by a critical period 
phenomenon. Younger children appear to approach 
English skill /levels of monolingual grade peers more 
rapidly. • 

When English gains are examined without reference to 
grade ' norms, the superiority of younger children 
vanishes. In their examination of the critical period 
hypothesis, Snow and Hoefnagel-Hohle (1978) report 
that children acquire Dutch morphology and syntax and 



listening comprehension more rapidly, Contrary to the 
hypothesis, 12-15 year olds learned Dutch most rapidly 
and 3-5 year olds, least rapidly, 

After reviewing language acquisition iiterature, 
Krashen, Long and Scarcella 71979) conclude that 
adults and other children acquire syntax and 
morphology faster, at least in the first year, than 
younger children. However, people who are naturally 
exposed to second languages as children become more 
proficient than those who are first exposed as adults. 



Table 1 

Achievement Test Gains in NCEs of 
Bilingual Programs in New Jersey 
During the 1980-81 Academic Year 
in Reports to the State 



Grade 




Pretest 


Posttest 




Cluster* 


Number . 


Mean 


Mean 


Mean Gain 


1-3 


' 21 


27.9 


522 


24.3 


4-6 


16 


28.0 


42.0 


14.0 


7-9 


16 


244 


342 


10.2 . 


10-12 


18 


10.4 


15.8 


5.4 



'X 

•Becausefthe unit of, analysis is program means, the number of 
available means is small, and grades are clustered to increase stability. 



These studies do not conflict with the achievement data 
findings. The achievement data do not show that 
younger children outperform older children. Rather, 
- they show that today's younger children acquire 
English more rapidly than the younger children in the 
test norms. Similar results have been noted when 
comparing performances in IQ tests between today's 
children and age peers from years ago. 

Herman (1979) notes that the 'same raw score yields, a 
lower IQ on the WISC (Wechsler, 1949) than on the 
WISC-R (Wechsler, 1974). He theorizes that these 
differences may either reflect differences in 
standardization samples or true increases in abilities 
that result from improvements in underlying capacities, 
test wiseness or cultural exposure to toys, books or 

television. <, 

. ^ * . *• 

Herman's; data on the S//7SfYTerfrian and Merrill, 1972) 
show that interedition IQ .differences are more 
pronounced for younger children. This implies a 
curvilinear relationship between measured abilities and 
year of birth." 

As an illustration/ suppose that the improvements in 
measured abilities were predicted by the year, of the 
child's birth. Six-year-olds (first graders^who took the 
test in 1979 would average three score units higher 
than six-year-olds who took the test in 1976 under a 
simple linear model (growth-birth year + constant). 
The same three units would separate children of any 
age.* 

•Actually, the same three units times the regression slope. 
•168- .- 15Q 



Under the simplest quadratic model, e.g. growth-(b) 
(birth years) +. c, the differences for. six-year-olds 
would follow; »b(y+3) 2 ^(b(y) 2 -fc)-6by+9b, where b 
is the slope of year on growth, y is a linear function of t 
the birth year and c is a constant. For 1 1 -year-aids, this 
difference would be smaller, because they would have 
been born five years earlier, and y woukJ be five units 
less. 

IQ differences between the, WISC and WISC-R are 
greater on the performance scale than on the verbal 
scale and interact With age, (Doppelt and Kaufman, 
1977). Below age 11, IQs average about four points 
lower on the verbal scale and about seven points lower 
on the performance scale, Above age 11, the 
performance scale still averaged about six points 
difference, while there were virtually no differences on 
the verbal scale. 

Herman (1979) argues tljat today's children are 
"brighter." In rates of language acquisition, the 
differences between contemporary children and age 
peers of a few years ago are largerjor younger children. 
This is consistent with the finding that differences in 
' verbal IQ fall off after age, 11 (interestingly, at about 
sixth grade — the last grade in LAB\eve\ 2). 

Lazar and Darlington (1982) similarly report that IQ 
' gains made by children who participate in early school 
intervention projects are not permanent. They are 
largest after one or two years. The authors claim that IQ 
and achievement tests are related indices of developed 
abilities. The early intervention promotes cognitive and 
noncognitive skills, such as attentiveness, 
perserverence, etc., which influence IQ and school 
achievement. 

The aptitude gains which Herman cites may result from 
cultural influences (books, toys, television, etc) which 
act like the early school projects. If so, they may well 
drop off after a certain period. Tests measuring these 
aptitudes would show large (and perhaps nonlinear) 
grade effects. Interestingly, Lazar and Darlington (1982) 
report such achievement test gains by program 
participants, but attribute these to methodological 
artifacts. 

Troike (1981) believes that IQ and achievement scores 
may be largely determined by -acculturation to middle 
class American. If so, the patterns described by Title I 
and bilingual education students may be accounted for 
by a cultural interference hypothesis. Younger children 
make larger cultural gains in the same time period 
simply because they have less experience with (and 
less interference from) other cultural values. 

This is a simplistic elaboration of Troike's hypothesis. In 
fact, other factors may be operant in acculturation of 
various grade levels. For example, the hypothesis does 
not account for the significant grade effect in Level 1. In 
the original study analysis, (repeated measured), 
"pretest-posttest growth was significantly larger for 
second graders (F (1,247) =11,72, p<.001), when by 
hypothesis, it should have been smaller. 
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The First Grade Anomaly 

• . ' r « 

The linguistic interdependence hypothesis (Cummins, 
1979) states that a child's first-language competence at 
the time of intensive exposure to the second language 
predicts second-language acquisition, Cummins (1981) 
also believes that skills in both languages are predicted 
by a common underlying proficiency, The WISC-R 
.would certainly be sensitive to such a ppnstruct, 
Moreover, this underlying ability may well contribute to 
achievement in mathand reading. 

Cummins' two hypotheses are consonant with the data, 
First graders with more program experience actually 
score lower than those with less program experience. 
More experienced students may have been prematurely 
exposed to intensive English, By second grade, these 
differences favor students with more experience. 
Perhaps any growth-inhibiting effects of premature 
. English exposure have been overcome (third-year 
second- graders scored highest) because, the bilingual 
program and maturation sufficiently strengthen the 
native-language skills by the third year of the program, 

The interdependence of language skills explains the 
nonlinear age (year of birth) function. Verbal skills, 
which are more' developed among contemporary 
children, stimulate cognitive growth which aids further 
verbal skills development (Cummins, 1979). 

i 

Summary 

The current paper easily raises as many questions as it 
answers, -Much of the" scoring variance on language 
acquisition tests is determined by program effects. 
However, there are obviously non-program effects 
which contribute to this variance as well. 

Primarily, there is a general grade effect, in which the 
amount of gain relative to test norms, is typically very 
large in the early grades. Younger children are not 
outscoring older children, but are scoring higher 
compared to grade peers from years ago than are older 
children. This effect is, at least in some part, related to 
IQ increases among today's children, particularly in 
verbal scales. 

The relationship between grade and score improvement 
appears to b^nonlinear. However, this could not be 
shown with the current data, in which there was ho 
effect forgrade within any level. 

The large differences in test scoring among test levels 
probably has several components. Since this instrument 
is not vertically equated, one cannot assess if the. 
difficulty of the levels are different for bilingual 
populations. However., one must question whether 
.variations in test difficulty have a larger effect on 
depressing the performance of LEP students than on 
• monolingual English students. 

Recent test literature and achievement score, analyses 
suggest that the grade effect is not so easily explained. 
In fact, the scores are so similar to findings in other 



areas that Improved- Wbal abllltiea of today's children 
may well explain various findings. 

Finally, longitudinal research shows that Interventions 
which raise aptitudes may not be permanent, Perhaps 
factors such as improvements in cultural stimulation 
available to contemporary, children account for 
improved "performance on IQ tests and achievement 
tests uf\ to the sixth grade. Beyond sixth grade, the 
flatter ettect'of irryrovements over grades Is attributed 
to the constant influence of the bilingual program, 
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"SYMBOLIC FORM8 BETWEEN YOUTHFUL 8AME-8EXED, CL08E8T FRIEND8. 



8, Klrson Weinberg, Ph.D. 
Sociological and Psychological Associated 



Abstract 

In this Inquiry, we analyze the relationship between the 
rudimentary linguistic forms and same-sexed closest 
friendship among colloge youths. Our concern is to 
determine the types of verbal and gestural symbols that 
arise and characterize this mode of socially intimate 
relations. The forms of address between intimate equals 
are differentiated from those between status unequals 
and between socially distant equals; the specific types 
of personal address between closestJriends and their 
meaning for intimacy are presented. The situations and 
circumstances 1 which influence the spontaneous 
formation of idiomatic verbal forms and gestures and 
the categories of these improvised ahd expressive 
symbqls are described and their effects upon the 
intimate relationship elaborated. The generic 
hypothesis between socially intimate relations and 
cognition and modes of 1 speech is characterized and 
distinguished from the theory of established culture as 
heritage and its cumulative property as it affects 
language and thought. , . 

SYMBOLIC FORMS BETWEEN SAME-SEXED. 
CLOSEST FRIENDS 

Symbolic, expressions of special words, phrases and 
gestures are aspects of cognitive communication 
between same-sexed, closest friends. Since social 
relations are the matrices from which symbolic forms as 
meaningful consensus emerge, the modes of verbal and 
gestural expressions within personal relations would 
differ from the verbal and gestural forms in impersonal 
relations (Mead, 1934, 1938). Consistent with this 
perspective, the sociaf styles of speech, the turns of 
phrases, the "slangy" terminologies arise from the ways 
social relations arrange themselves (Sapir, 1949). "This 
approach is applied to the study of the modes of 
linguistic expressions of same-sexed, closest 
friendships among single undergraduate college 
students in the urban community. Our specific concerns*, 
include 1) the variations between the elaborate 
language and argot systems in specialized subcultures 
and this spars& and simple verbal forms in closest 
personal relations; 2) the modes of address between 
persons of unequal status, and those of equal status as 
well as between social intimates; 3) the situations and 
circumstances as contexts vyithin which words, phrases 
and gestures are incorporated into their peculiar stream 
of communication; 4) the types of words, phrases and 
gestures that characterize the social intimacy between 
closest friends;°and 5) the effects of these modes of 
expressive language upon these relationships. 



Rita M. Weinberg, Ph.D. 
Department of Psychology 
National College of Education \ 

N 4 

The subjects consisted of single, undergraduate 
students from four urban colleges and universities. The 
subjects were urban-reared, 08 percent native 

"American with some foreign-born who had Immigrated 
during childhood, single or never married, 
predominantly wtyte — 97 percent — and in the broad ' 
middle class of white collar. and skilled laboring or 
artisan fathers, The age interval of 18 through 25 was 
used for these integral reasons: First, they were 

• accessible. Second, they wore In a time span v/hen they 
were disengaged or disengaging from the ffmiily and 
were not cpmpletely committed to a mate. As 
individuals,' they are reaching out for peers, so that the 
same-sexed friendships are perhaps the closest of any 
that they have formed or will form in the future. By 
disengaging from the family, their friendships are 
ihdividually rather than family rpotivated and activated. 
Since peer adolescent relationships generally share a 

' pervasive subculture, their modes of interaction also 
share a distinctive language at variance from that of the 
adult subculture. This adolescent subculture with its 
- emphasis on informality provides the basis for the 
modes of. speech between closest friends' who tend to 
incorporate selected words, phrases and modes of 
speech to express their social intimacy. 

Since this particular inquiry is a phase of a larger study, 
the information analyzed was drawn from 210 
semi-guided personal documents. The Subjects who 
composed these documents were asked to describe the 
development and crucial features of their friendship. 
The aim of the analysis of these documents was to 
determine the degree of social intimacy between the 
closest friends. From these personal documents we 
constructed a series of questionnaires and Scales for 
measuring specific traits of social intimacy. One 
questionnaire concerning the linguistic expressions of 
social intimacy between same-sexed, closest friends 
was administered to 51 subjects. 
t ■ 

Closest friendships, as we found, vary considerably in 
degree of social intimacy. Hence, their modes of speech 
may also vary. The same-sexed, closest friends 
comprise the peers selected by the subjects because 
only each subject can designate his or her closest 
friend. The degree of social closenessWjetween thes]^ 
closest friends then became the objecuro of inquiry, 
and the modes of speech between closest friends 
would reflect the variations in their social intimacy. This 
paper, however, is to present the linguistic exp/essions 
of the closest friends who are indeed intimate because 
those closest friends, who are somewhat distant 
despite pretensions of a close relationship, have 
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symbolic expressions snnilar to others in their aoa and 
sex categories 

An interest in the idiomatic verbal symbols between ( 
friendly intimates aa a phase ol linguistic ihquiry may 
be regarded an extension of the study ol specialized 
languages and argota of subcultures including ethnic, 
deviant and occupational types, The languages of 
foreign groups as ethnic units Bottling in and becpming*; 
transplanted Into the American pluralistic urban 
community were noted as integral and even central 
parts of their specialized subcultures, The basic feature 
of these linguistic systems is their cumulative character 
of an established tradition and heritage (Thomas and 
Znaniecke, 1921; Wirlh. 1927.) The techniques used in 
studying ethnic subcultures and their languages were 
extended to the study of deviant and occupational 
argots. Thus elaborated argots w^fje synthesized for 
varied criminal groups such as the pickpocket, the 
professional thief, the career criminal and the drug 
addict (Mauref. 1936. 1938. 1955; Sutherland, 1937;- 
Lindesmith. 1947; Weinberg, 1942). The argot as a 
specialized form of linguistic 'system was noted too 
among doctors, lawyers, and other groups (Becker, 
1964). $ 

The distinctive trait of these argots is their cumulative 
-property, in that the language grows and changes 
through a process of accretion of "new generatiohs" of 
recruits as these become incorporated into specialized 
units, and they in turn add words to the specialized 
vocabulary, as these terms are transmitted further to 
recruits and members. Verbal accumulation is lacking in 
the sparse symboiic forms generated between 
same-sexed. closest friends whose particular modes of 
expression arise de novo from the closeness, of their 
relationship and dissipate with its dissolution or 
disruption. 

Ill 

At the outset, it is evident that in several cultures. e.g.V 
French. German. Japanese, Lower English, the modes of 
address for social intimates vary from those in unequal 
status positions and in some degree from equals 
generally. Status equals, whether intimate or distant. 



address each other in French by Til," while superior 
and inferior address each other by M Tu" and "Vous: 
respectively A superior is addressed by a title, eg . Mr. 
Mrs. or Miss, and a last name (TLN). while the 
subordinate is addressed by a first name (FN) (Drown, 
1965; Silverberg, 1949, Fay, 1920). Equals address each 
other at leant by their first. names, but intimate equals 
may use abbreviated first names as well as nicknames. 
In Japanese the pronoun for you, "Klmi," is the mode of 
address for inferiors but also between equals, while 
"anata" terms for you. and in German lor "ale" is the 
address to superiors in contrast to "du" and to strangers 
Brown has regarded these modes of address as 
invariant norms in linguistic behavior (See Chart I). 

The change of address betwb^n dyads from Vous to Tu 
represents a change from unequals to equals. In 
Germany this change in mode of address is done by a 
rile of passage, a brief informal ceremony called the 
"Bruderschaft" One person who waits for a congenial 
mood, probably over a glass of wine, suggests: "Why 
don't wo use Tu to address each other?" The response is 
'. favorable and means a shift in image that is reciprocal. 
However, one necessarily non-reciprocal pfetose of the 
relationship exists, specifically that one of the persons 
most be explicit in verbalizing the suggestion. But in the 
American society, these German modes of address do 
not exist - Indeed, it is chardtterized by informality 
which is especially pervasive in the adolescent 
subsociety 

Among our subjects. 55 percent addressed their closest 
friends by full or abbreviated first names, while 37.3 
percent addressed their closest friends by a variation of 
a first name or a last name - e.g.. Mac for Mackenzie 
or by a nickname. Only 3 percent addressed their 

' friends by their last names and one percent used the 
formal title of Mr. or Miss. Most of those who addressed 
or were addressed by their closest friends with 
nicknames had known them from childhood. Since the 
preponderant 85 percent of the subjects had lost their 
childhood friends, a minority of the subjects as we have 
indicated used a variation of the first name or a 
nickname. Seemingly, those in the upper 

" socioeconomic stratum are more likely to sustain their 
friendships from childhood: they call each other in 
private by their nicknames or twisted first names. 
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Chart! 



Forma of Addraat among Superior* and lnf«r ion, Intimata Equal* and 
Noivlntimata Equal* * . 



v v ' 



auptif mr 



inftiMor 



supprior^ 



TLfl 



FN 



inferior 



'mtimata aqual» not intimate equals 



FN 



TLN 



intimate equals not intimate equals 



superior 



anata 



j 



kirn/ 



inferior 



ktmt 



anata 



intimate equals not intimate equals 



a, Examples of the invariant norm 



superior 



inferior 



intimate equals not intimate equals . 



b. JGeneral form of the invariant norm 



From Roger Brcfwn. Social Psychology. (New York: The Free Press. 1965.) pp.93. 
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Another mode of address among equals and especially 
between intimates is the diminutive, characterized in 
the suffix by an added V or "ey." For example, Joseph 
is abbreviated to Joe and in the diminutive becomes 
"Joey." Gertrude .is abbreviated to Gert and in the 
diminutive becomes "Gertie." The subjects inter- 
mittently added the diminutive to the abbreviated first 
name, but few used the diminutive continually, except 
for certain names, such as Jerry for Gerald or Jerome. 
The subjects felt that the diminutive is more 
appropriate for other socially intimate relations, 



particularly between parent .and child. The subjects as 
adolescents and youths seemingly wanted to grow out 
of their childhood role, "the kid stuff". Hence they 
regarded a diminutive reference as not desirable 
continually, although occasional references were 
acceptable, even desirable. Another influence in the use 
of the diminutive is the recourse of some politicians to 
accept the informal diminutive of their first names. 
President Carter insisted upon the use of the name 
"Jimmy," rather than Jim or James. Governor Brown has 
acquired the name "Jerry." while former Vice-President 
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Nelson Rockefeller was called "Rocky: But it shou|d*be 
emphasized that in daily interaction with subordinates . 
and formal associates the politician is not called, by his - 
diminutive first name anymore than President 
Eisenhdvver in his daily relations was called "Ike." 

Does the roQde of. address affect the self-conception of* 
' the friend in the relationship? The reciprocity of an 

informal role-identity of friendship , by the manner -of , 

address indicates the closeness of -the relationship and, 
. at times, the manner of affection from and for the friend.' 
*' An abbreviated first name may create the opening Of an 
' informal relationship and also a feeling of potential 

approvaVand acceptance. Although the, diminutive of a 
. first name Is used infrequently and for some friends not 

at all, its use pertinents to a given mode of affection 
« indicates a. .reaction of closeness and emotional 
- warmt'h. Parenfheticatly, between nonintimates, it can 

be used and interpreted as a ' sign o,f familiarity and 

.depreciation. None of the subjects used, the diminutive 

first name in thi^Jatter manner 



We have determined that about "70 percent of the 
subjects' are socially intimate with their closest friends. 

J. About 23 percent were informal, but not socially 
intimate with their closest friends, and about 7 percent 

.seemingly lacked same-sexed, closest friends 
(Weinberg, 1970). Among the present group of 51 
subjects, all of whom claimed to have closest friends, 
45.1 percent and 35.3 percent claimed that their modes 
of 9 speech differed "markedly" and "slightly," 
respectively, from those with same-sexed peers 
generally;' 19.6 percent of the subjects averred that 
their mode of speech with their closest friends did not : 
differ from that with other same-sexed peers. 

• ; Table I '~ 



Mode of Difference in Speech 
between Same-Sexed, Closest Friends 
and between Others of Same Age/tnd Sex 

. . ■ . ■ 

lode of Difference fM'-) 




Markedly Different 
Slightly Different **• 
JSIot.Discernibly Different 

Total 



.■)■'■ 

• Percent (N = 51) 



45.1 
35.3 
19.6' 



100 ; 0 



Since continuou^spcially intimate relations' between: 
closest friends intensify mutual confidence and trust so 
- that more personal, secrets and .private reactions ar6 
disclosed, they^hend. to develop an "In" frame of 
reference' As'^a consequence, their mode of 
communication. becr^rp^condensed so that a word or a 
phrase or at limes, .pgff a^gesture may' connote an 
, elaborate episode of S^eriences. This kind of talk also 
' excludes others as "oXtsiders N from understanding the 
communication, This'Ytype of close friendship and a 



consequent. mode of communication is described in the 
following^ statement? "Our relationship dWing the major 
part of our friendship was somehow chajacterized by a 
mutual exclusion of other friends. I'Sperpt most of my 
spare time. with Diane and participated^ few activities 
or groups Without tier. Besides Diane, I had no other^ 
close friends, though I knew some Girl Scoyts^. But tJpse 
friends did not know Diane, who was not a Scout! So, 
most', of the time. J spent with Diane was 
unaccompanied by others. I remember sharing secrets^ 
with her which we' pledged never to repeat 'to^others. 1 
Since neither* of us 'had other close friends, it was very ' 
likely that we confided to a-greaf extent. We devised a 
code by which<we could' relay messages through the 
classropm. The code was pe^rt of our secrets which was 
known4o no one but us. Betause of the Way we-acted, 
we, recognized each , other as best friends and were 
identified as best friends by all our school mates." 

Since closest friends are informal, , they are also 
spontaneous wllh each other and careless of their mode 
of speech and sentence'structure or syntax. \ • - 

"I think there is complete lack of formality between 
Edith and I. When I talk about Edith I usually call her by 
her first name, but when I talk to her, I usually call her 
Edie. She calls me Maureen or Maur. . , 

"I certainly talk' freely in just about everything, and. I 
don't think that either of us hesitates to express 
. ourselves better, or more formally. We say -what we 
■ want to say when we want to say it. One reason I feel is 
'"the cause of our lack of sent^pce structure and very 
frequent use of symbols is the 'fact that Edith's parents 
are foreign-born^arid Edith is not very good in English. 
She gets her sentences mixed up~ sometimes*; or 
sometimes she uses the wrong word to express herself. 
I do not hesitate ' 'to laugh when she expresses 
something'in a funny way. I do not know how Edie t^kes 
my joking arourtd like thisja^J feel that'others who do 
not undW&arfa heN even be 

worse than I, and I [e^^^^e should trust in a friend. 
What I am*tryirig to^sl^T^iat when I laugh at her, l am 
not laughing at her but at what she says. Sometimes 
when I make errors ia my speech, I krnd-of compare her 
-errors with- mine'.. In this way,. I ^m hoping that she wiljy- 
not feet b'ad when ifihe rpakes these errors in front of 
- strangers - or tho^e* she - wishes to Inake a good 
im^ljj^sion upon." 




she wishes * to tnake a 



jk & "Since spontaneous arid; expressive, speech ; eme|ges' 
> ' froift close fnendship^the mode of conversation may 
become idi^^tic- on-4his >level. • These, verbal and 
s^rjse.djjTmg circumstance? of relaxation;. 
grand aftjgr parties ordance$, during a stay 
S home, 31.4 percent; onVvyeekends or 
f, 35^3; /percent. The friends slough their 
speech, 'rep and regress. They resort to silly behavior, 
as the follg^ing ipsfence illustrates. 



gestural forr 
"such as,dur|j 
at the trie 
when up 



Table II 

Circumstances Stimulating Spontaneous 
And Informal Speech between 
" Same-Sexed, Closest Friends 



Circumstances Percent 

; ,(N = St) 

Before, during or after Party, 

Dance or Celebration . i 31.4 

Staying Up Late, At Friend's Home, 

Weekend Meeting 35.3 

Being 'Together a Lot" 19.6 

After Argument or Trouble ' ' 7.8 

No Response^ 5.9 

Total ' ; • 100.0 



"Whenevej possible one of us would spend the 
weekend aMhe other's home. We followed a, pattern of 
activities each time. We ate, danced, to tjie 'latest 
records, set each other's hair, then ate some more and 
read Mad comic books all hours of the -night andstarted 
using* gushy words. Then we would start laughing 
ourselves sick, turn off the light and laugh some more. 
Some of the time we were in absolute hysterics over 
absolutely nothing. Our parents or one of them would 
then tell us to stop acting like kids. Wef would then try 
tosettledown and gotosleep." 

In another situation, the movies, the following two 
friends reacted in hilarity and generated a word of 
singular meaning to the two friends. ; 

"Mary and I have always had words and names for each 
other. I recall one typical experience that led to- new 
words. Once we went to the movies and saw a jungle 
picture; the. story was about a man's struggle against 
red ants, or marabunta. For some reason we thoifght the 
situation was hilarious. We came away from the movie 
roaring with laughter and never forgot the word. For 
years afterward, one of us would interject the word 
"marabunta" into serious conversation and it would 
provoke laughter. People around us- were baffled by pur 
behavjor ^nd wondered what we were laughing about" 

lV thus the more hilarious emd relied the circumstances, 
/the more apt are the frienids |o se!ect oir/to distort yvords 

about -which to laugh w^^ 

i'ditimatjc meaning betvyeen th^pnds.v . 

Anothercharacteristr^ the manner 

of expression b^tVy^em x^^sfes|^riends that occurs 
during relaxed, experiences. 
Words may. be ; Sl^^^Mp^syntax may change to 
incoherence, ^uslo^ admittedly omitted the* 

■Vs" from words sc^that "better" became "betta," harder" . 
became "harda; ^ ^bother" became "botha," and this 



change was known by the other friend as an acceptable 
vocal mannerism. 

. this spontaneous speech leads to spontaneous 
distortion of words or to syntax incoherence, as the 
following characterization attests. 

. ' t ; ■. • ■ ■■■ . ■ -. ' • 
"We speak incoherently, in a. sort of shorthand that 
others have a hard time foiiowing. Neither of us are very 
organized in speech; and through our knowing each 
other we hafVe . (level oped patterns of speech which we 
/.use mainly when we are alone with each other." 

*» ' ■ 

Closest friends" who are socially intimate differ in their 
speech with each other from their mode of speech with, 
others. As can be seem from Table III, the subjects claim 
that they more freq&ently use slang, swear and employ 
pet phrases. In addition, those persons- in : Bilingual - 
families 'incorporate these ■ bilingual words into their 
dialogue. Some have devised new words and even 
phrases.- 

• ' •. ', •• 

Table III 

Modes of Speech Used by Subjects 
More Frequently with Closest 
^Same^Sexed Friends Than 
With Peers of the Same Age and Sex 

- • * . •" • • J" 

Mod^ of Speech ' * Percent (N=73) # 

Slang Expressions ~ 24.7 

Swear Words - 20.5 

Pet Phrases 19.2 

Foreign or Bilinggal Words 

or Phrases \ : 12.3 
Newly devised VVo/ds or Phrases ™ 9.6 
Other: ..' 

"Open-up" More , 4.1 

Sarpastic and Teasing f 4.1 

Seemingly No Difference 5 5 

Total 100.0 

*The number exceeds 51 because the subjects used 
more than one "mode of speech." 

In addition to the words and pet phrases that some 
closest friends incorporate into their dialogue, they also 
may adopt certain physical gestures which may have a 
specific or even a unique or idiomatic meaning for 
them. The specific meaning of a given gesture may be 
similar }o the interpretation which others in the 
adolescent culture express; The types- and frequencies 
of the gestures indicate that innovative and original 
gestures tend. to be sparse between closest friends. By 



far, most gestures are through eyes, 27.1 percent and 
fingers, 28.8 percent with the body generally, 18.6 
percent; the mouth, 13.6 percent; and the hand, 11.9 . 
; percent (table IV). 

Table IV 

Types of Gestures Used in the 
interaction between Same-Sexed, 
- , Closest Friends 

Type of Gesture r" Percent (N=59) 

Fingers 28.8 
Insert'finger ii^mouth (disapproval), 
Number of fingers for time and 
place of meeting » 

Mouth,, 136 
Speak inaudibly with slow mouth movements; 

Quick, short smile (Bad); Prolonged smile 

(Good) 

Eyes 27.1 
Roll eyes (Confused) 
Blink eyes (Approval) A 
. Shift eyes from side to side (Doubt) 

Hand • 

* Insert fist under chin (disapproval) 11.9 
Place hand on chest 

Body Generally 18 6 

Bow to each other Japanese style 
(No specific meaning) 
Hug friend (general approval) 

*Pat on the back (no "lie") 

Total 1000 

— r 'Since subjects used more than one gesture, the 
total is more than the number of subjects, 51. Six of 
the subjects indicated that they used no particular 
gestures that have a special meaning for them. 

> ■ :• , \ . 

VI ' 

We have pointed out that social intimacy between 
same-setfed. closest friends tends to influence a mode 
of speech, consisting of words, word enunciation and 
gestures/ that serves* to sustain and reinforce the 
intimate relationship. On the other hand, the recourse to 
critical, hostile words or phrases serves to cool and 
recede or even disrupt the relationship. 

On a positive level, informal, spontaneous symbolic 
interaction pertinent to an intimate' friendship 
relationship may- contribute to such personal 
gratification that the expressive process makes the 
mode of speech and the substantive symbols 
end-values. By contrast, the impersonal, instrumental or 
Utilitarian relationships are means to ulterior ends so 



that the modes of speech arid words used are designed 
for effect to achieve a different objective. 

When the mode of speech, modes of address and the 
words used become formal, hostile or depreciating, 
then the relationship may cool and recede to one of 
distance. But when the modes of address and words 
used emotionally support and enhance the self-esteem, 
the relationship may become sustained in its closeness. 

The verbal symbols as "idioms of social intimacy" imply 
an in-group reaction as well as attitudes of solidarity 
and unity The friends as interactants realize that they 
alone have the meaning of these terms so that others 
are "outsiders." The more frequent the use of idiomatic 
words and gestural signals, the more pervasive the 
close friends represent an "in" social unit (Slater, 1963; 
Newcomb, 1961;) 

The spontaneous recourse to slang, slurred and swear 
words characterize verbal forms of informality and'ease 
of presence as well as expressive release and catharsis 
in some situations. Hence these substantive aspects of 
communication indicate the interpersonal moods of 
trust and relaxation between the friends. These types of 
expression in interaction make the relationship 
satisfying and pleasurable and tend to intensify the 
bond between the friends. 

Conclusions 

The demonstrated consistency between socially 
intimate friendships of the same sex and the modes of 
emerging symbolic forms as an object of inquiry is a 
discontinuous extension of the study between culture, 

• thought and language. This approach to linguistic forms 
is unlike the conception of language as an aspect of a 
growing, accrufng cultural heritage which is 
transmitted to incoming novices and reinforced among 
those of the same generation. Instead this relational 
"approach to the formationandTetention of verbal and 
gestural forms emphasizes that these arise from the 
modes of intimate interaction, lack any cumulative 
characteristics and dissipate with the. receding 

- closeness of the friendship. The cultural approach to 
cognition and language is consistent with the 
Durkheimian approach and developed largely by Sapir 
(1949) and Whorf (1956). The relational approach is 
consistent with the Meadian view (Durkheim, 1951; 
Schneider and Homans, 1955; Barnow, 1973; Mead, 
1934; Weinberg, 1952; Gibson, 1969; Horowitz, 1970; 
Neisser, 1976.) 

The modes of intimate soc'i^T interaction are the bases 
for distinct if fragmentary .types of idiomatic linguistic 
forms and gestures that arise from and reinforce this 
type of relationship. We have presented the 
circumstances and types of verbal forms that arise in 
this interactional context. Although Whorf h£s 
emphasized that the language as an integral aspect of 
the culture influences the modes of thought, we have , 
found in socially intimate relations the improvised and 



spontaneous terms that are spoken are infldenced by incorporated into their substantive interaction are 
the interpersonal feeling, tone, and uncritical expressive consequents of moods, especially euphoric ones,,and of 
thinking. , Thus the words and gestures as these are permissive, even regressive thinking. 
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THE INFLUENCE OF PUBLIC POLICY ON LANGUAGE 
ASSESSMENT OF BILINGUAL STUDENTS 

Rudolph C, Troike 
University of Illinois at Urbana-Champaign 



From a policy-oriented. .point of view, it is interesting to 
see how concern in -recent years for language 
assessment in bilingual education has been primarily 
the consequence of legislative and judicial decisions,, 
rather than of purely educational decisions. If we 
recognize that all educational policy is fundamentally 
political (indeed, the word politics in Spanish 
comprehends both policy and politics), this situation 
should perhaps not be surprising. Nevertheless, it is true 
that few other areas of the educational arena have been 
so directly affected by the force of legislative and 
judicial events as this one^ 

Two strands can be discerned in the policy decisions 
which have led to the present demand for language 
assessment and language assessment instruments. One 
of these is accountability to funding sources or courts, 
and the other is determination or limitation of access to 
bilingual services. Policy decisions requiring language 
assessment arise from several sources: 

^ Congressional legislation, 

2. Court decisions, 

3. State mandates. . 

We shall examine each of these in turn below. 
Congressional Legislation 

The original bilingual Education Act (PL 90-247), which 
was signed into law by President Johnson in 1968, was 
the first major piece of federal legislation to recognize 
the special educational needs of children of limited 
English-speaking ability. By providing funds for 
demonstration programs, which were required to 
conduct evaluations, the Act created a limited demand 
for language assessment instruments to measure 
student progress. - ; - , 

, The 1974 amendments to the Act (PL 93-380) restated 
its aim as being , to establish equal educational 
opportunity by supporting programs designed to enable 
students, while using their native language, "to achieve 
competence in the English language" (Sec. 702a). 
Discussidns aBout "maintenance" programs to th6 
contrary, notwithstanding/ the attainment of . compe- 
tence in English was specifically one of the declared 
goals of federal policy in providing program assistance 
for limited English-speaking .students, with the use of 
the natives language merely serving as a means to that 
end. : . 

The 1978 legislation (PL 95-561) narrowed this focus- 
even further by .defining a bilingual program as one in 
which instruction was (designed "to allow a child to 
achieve competence in the English, language" (Section 

;703aJ. This should have dispelled though it did not 



— any illusions or misconceptions, either by opponents 
or proponents of bilingual education, that the 
legislation supported maintenance of other languages 
or linguistic separatism. The title "Bilingual Education 
Act" was clearly a misnomer; it would have been more 
accurate to refer to it as the "English Assimilation Act" 
since it was designed, in fact, primarily as an instrument 
for linguistic assimilation. 

The 1978 amendments also explicitly revised and 
expanded the designation of the target population in 
two ways: (1) by changing "limited English-speaking 
ability" (LESA) to "limited-English proficiency" (LEP), 
which included reading and writing, and (2) by allowing 
the inclusion of students from non-English language 
background (NELB) homes, even though the students 
might be, at least superficially, fluent English speakers. 
The latter provision ' enabled American Indian 
communities which had largely lost their, native 
languages to apply for Title VII funds, since research 
had shown that their English was still limited in ways 
that caused academic difficulties. 



The emphasis on English in Title VII has continued to 
the most recent steps towarSl reauthorization, with a 
proposed Administration bill which would specifically 
allow funding for nonbilingual instruction in English, 
and another bill by Sen., Huddleston, which would 
"require program exit based on English-proficiency test 
scores. 

Although at no time have evaluation guidelines for Title 
Vll-funded programs required English-proficiency tests, 
these have widely been used as a means of assessing 
program effectiveness. The further use of English 
proficiency to screen students in program intake' was 
given a strong impetus by the Congressionally adopted 
limitation on percentages of ndn-LEP students included 
in programs, a provision sparked by the finding of the 
American Institutes for Research national evaluation 
(Danoff! 1978) that large numbers of (apparently) 
competent English speakers were enrolled in Title VII 
programs. Since^teck of -English proficiency had been 
the motivatioogbf offering bilingual education in the 
^fipst place, it should not be surprising that the 
attainment of "proficiency in English should be seen as 
the b^sis for qualifying or not qualifying for inclusion in 
a program. At the same time, the move to adopt English 
proficiency as a criterion for program exit, as part of a 
policy of limiting bilingual, services, also appears 
plausible though research .{Cummins, 1980a and b; 
Troike, 1978) suggests that. it is only speciously so. 

Jtoe Civil Rights Act of 1964, under Title VI of the act, 
forbade discrimination on the basis of race, color, or 
national origin in the operation of federally assisted 
programs. In an important memorandum issued on 
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May 25, 1970 by the then Director of the Office of Civil 
Rights J. Stanley Pottinger, the protection of the Act 
was extended to cover children who were "deficient in 
English language skills" by vi/tue of their national origin 
minority background. Schools were required to provide 
equal educational opportunity to such children and 
were forbidden to discriminate on the basis of language. ' 
This, broadened interpretation of the Civil Rights Act 
brought about compliance efforts on the part of OCR, 
though they were not strongly pursued until the Lau 
decision by the Supreme Court gave them added force. 

Court Decisions 

The most far-reaching court decision was that by the 
Supreme Court in the case of Lauvs Nichols, which 
held that the Civil Rights Act (and the OCR 
interpretation of it) was being violated by failure to 
provide special assistance to students of limited- 
English ability. The opinion of the Court, written by 
Justice Douglas, observed that "Basic English skills are 
at the very core pf what these public schools teach . . . 
students who do not understand English are effectively 
foreclosed from any meaningful education." 

Although the Court specified no rerh^dy, the Office of 
Civil Rights developed guidelinesMor enforcing 
compliance with the decision, known\as the "Lau 
Remedies." These guidelines specified that school 
districts found in noncompliance must identify 
students' primary or home language^ and their 
functional ability in the home language and English. 
This requirement, more than any other, coming as it did 
with; the weight of compliance with a Supreme court 
decision, stimulated the national search for valid, 
reliable, and easily administered instruments to classify 
students by their English ability. 

Specific court decisions, most notably the Aspira 
consent decree in New York City, have had the effect of 
reinforcing the requirements for language assessment 
at the local level. 

State Mandates 

Beginning with the State of Massachusetts in 1971, 
stated have passed legislation providing for on 
mandating bilingual education. Many of these acts or 
their enabling regulations specify language assessment 
criteria for program placement, and some states also 
specify criteria for program exit. - 

In Illinois, school districts are required by the State 
Board of Education to have a district assessment 
^procedure plan which must specify the criteria and 
instruments to be used, training for administration, and 
a description of procedures to be followed. The annual 
bilingual census classifies students into those 
categories according to their level of English. 

A number of states have mandated the use of particular 
language tests which are considered acceptable, such 
as the LAS (Language Assessment Scales), BSM 
(Bilinguar Syntax Measure), or BINL (Bilingual Inventory 



Of Natural Language). Most states require or encourage 
multiple criteria for program placement or exit, though 
in some cases English test scores are the sole criterion. 

A recent departure, taken in California and Texas, is to 
require the use of a standardized achievement test 
given in English, rather than a language proficiency test 
per se fThis move accords with my own belief that such ^ 
a test provides a far better integrative assessment of the 
language necessary for successful achievement at a 
given level than would an isolated language-focused 
instrument.) . '■ 

Discussion 

We have seen that the growth of language proficiency 
testing for English, with notably lesser attention to 
other languages, was initially stimulated by the 
Bilingual Education Act of 1968 and given particular 
impetu§ by the Lau decision and related court actions. 
State mandates have added to the demand for tests, 
though the withdrawal of the proposed Lau regulations 
by the Department of Education in early 1981 and the 
shift * of some states to requiring standardized 
achievement tests may signal that the demand has now 
peaked and will begin to deeljne. However, possible 
moves in some states and even nationally to tighten 
exit requirements in terms of language proficiency, 
being pushed most strongly by those who see bilingual 
education primarily as an assimilative device, could in 
fact increase the demand for tests. 

Thus the market for English language assessment 
instruments has been created almost entirely by federal 
and state legislation and regulations and court 
decisions. (A few ancillary demands have appeared, 
such as the need for instruments and procedures to 
collect and validate census data on the number of 
limited-English-proficient persons of school age in the 
country, but this was generated by the same Title VII 
legislation in. 1978). It is significant from the larger 
social perspective, however, to recognize the lack of. 
interest nationwide in such tests on the part of major 
testing organizations and marketers. Test development 
and marketing in the area pf language proficiency has 
been largely a cottage industry, inspite of the size of the 
potential market created by these -legislative and 
judicial actions. This situation results, I believe, from the 
deep-seated antipathy toward bilingual education 
which is endemic in the educational-industrial complex 
and which ultimately reflects the hostility toward 
cultural and linguistic differences that characterizes our 
profoundly xenophobic society. No other explanation 
on purely rational-economic grounds would account for 
thisjapparent lack of interest. 

. To return to the point with which we began, the 
demand for language assessment has been created by 
the imposition of public policy decisions on the 
educational system. Had educational institutions and 
educators been appropriately concerned about the best 
interests of their linguistic minority students, such 
decisions would not have had to be imposed from 
outside. Policies requiring bilingual education have 
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been perceived (and denounced) as political because 
they threaten the status quo — obscuring the fact that 
the status quo itself is political. The existing power / 
relationships are fundqmentally political and serve to 
determine allocation of resources ' and policy 
development within the system. However, these 
matters are rarely viewed as political since it is not in 
the . interests of thgse in control to have them 
recognized as such. 

One unfortunate fact about educational policies 
developed outside* the educational system is that, ' 
desirable as they may be, they sometimes lack the 
research base or educational realism that policies 
which arise from informed educational practice might 
have. In this instance, educational practice was little 
better informed than legislative and judicial decision 
making. 



As I have suggested earlier, the whole policy of 
language assessment for program entry and exit is 
seriously flawed, for it gives unwarranted status to 
isolated language skills as indicators of 'ability to 
"participate effectively in the educational program" \Lau 
v. Nichols), fKs recent research has shown (Saville- 
,. Troike et al., 1982; Cummins, 1980 a and b), we, in fact, 
know little about this relationship, certainly not enough 
to justify some of the educational decisions which are 
being made on the basis of language-proficiency assess- 
ment alone. What is needed is closer cooperation 
among policy-makers, researchers, and educators to 
develop more rational, effective, and educationally 
sound policies and procedures for their implementa- 
tions. In the meanwhile, educators should insist that 
language measures alone are not enough for making 
educational decisions, and that other educationally 
grounded criteria must be used in addition, or even 
instead. 
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THE ISSUE WaNGUAQE PROFICIENCY TESTING 

AS A REQUIREMENT FOR 
BILINGUAL EDUCATION TEACHER CERTIFICATION 



Guillermo Duron 
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Introduction and Statement of the Problem 

The issue of language proficiency requirements for 
bilingual education teachers has been one of 
controversy since the first Title VII programs were 
initiated. While no one will disagee that bilingual 
teachers must possess some minimum* level of 
proficiency in the language or languages in which they 
are expected to teachJBkagreement arises when the 
details are addressed. Notious answers have been given 
to the following questions: - 



1. What are the areas of language to be addressed in 
setting minimum requirements (oral, written, 
vocabulary,<syntax, etc.)? 

2. Should the minimum language proficiency testing 
be conducted in the language of instruction or in 
both the target language and English? 

3. Should the minimum language proficiency 
requirements be the same both in English and the 
target language of certification? 



Should the criteria vary according to target 
language (i.e., less stringent requirements for 
languages for which there is a teacher shortage) ? 



5. What criteria will be used to determine language 
proficiency? 

6. Who will set the mihimum language proficiency 
requirements? 

Several individuals and organizations have examined 
the issue of language proficiency requirements of 
bilingual teachers. A review of the relevarit literature 
reveals that publications in the area of teacher 
certification of bilingual/bicultural teachers tend to 
encompass general guidelines which state education 
agencies or institutions of higher education can use to 
develop their own policies and procedures. In addition, 
all of the states which offer certification or endorsement 
in bilingual education require some evidence of 
proficiency in a language other than English. 

In 1974 the Center for Applied Linguistics published a 
set of guidelines for teacher preparation and 
certification in bilingual/bicultural education. Included 
in these guidelines are language proficiency criteria 
which the authors felt should be required of 
bilingual/bicultural teachers. The guidelines indicated 
that bilingual/bicultural teachers should demonstrate 
the ability to: 



1. Communicate effectively^ both in speaking and 
* . understanding^ the languages and within the 

cultures of both the home and the school. The 
ability will include adequate control of 
pronounciation, grammar, vocabulary and regional 
stylistic and nonverbal variants appropriate to the 
communication concept. 

2. Carry out instruction in all areas of the curriculum 
using a standard variety of both languages. (Center 
for Applied Linguistics, 1974. Pp. 2-3.) 

The National Association of State Directors of Teacher 
Education and Certification has adopted several 
standards which they designated as being appropriate 
to all college programs preparing teachers in 
bilingual/bicultural education. Standard Two addresses 
the area of language proficiency As stated, "The 
programs shall develop and/or evaluate the ability of. 
the prospective teacher to function and instruct 
students both in English and in the language of the 
target student population with fluency and accuracy, 
good pronounciation and intonation." (Waggoner, 1976, 
p. 41.) : 

Acosta and Blanco (1978) recommended that students 
completing a bilingual teacher education program 
should have a command of English and the target 
language in the areas of listening, speaking, reading and 
writing. They recommended that graduating students 
be able to provide instruction using standard language 
both in English and the target language as well as 
possess comprehension and speaking ability in the 
language variety of the student Th^y stronglV 
encourage the use of an exit language proficiency test 
evaluating both general language skills and educational 
terminology in both languages, * 

National Perspective 

In a survey conducted by the National Center for 
Education Statistics, Waggoner (1976) found that 
eleven states had adopted" speciaf requirements -.for 
bilingual education teachers.* All eleven required 
proficiency in a language other than English, while five 
required proficiency in English (see Table 1). 
Regulations concerning language proficiency 
requirements varied ,from a single sentence indicating 
that oral and written proficiency in the target language 
was required' to a full description of criteria used' to 
determinecompetencies in language skills; 



' 'Delaware has since prohibited bilingual education. 
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Table 1 



TYPE OF BILINGUAL LEGISLATION, CERTIFICATION AND 
REQUIREMENTS FOR TEACHERS IN BILINGUAL/BICULTURAL PROGRAMS IN THE U.S/ 



STATE 



Legislation 
Permissive/Mandatory 



Bilingual Education 
Certif ication/Endoraement 



Language Pro- . 
f iciency Requirement 



Alaska 


M 


Arizona 1 


P 


California 


M 


Colorado 


M 


Connecticut 


M 


Florida 


P 


Idaho 


P 


Illinois 1 


M 


^v'Hndiana 1 

7 lo*a ^ ty 

Kansas 


P 
P 
P 


< Louisiana 


P 


Maine 


P 


Maryland 
Massachusetts 1 


P 


M 


Michigan 1 


M 


Minnesota 


P 


New Hampshire 


P 


New Jersey 1 


M 


New Mexico 


P 


New York 


P 


Ohio 


P 


Oregon 


P 


Pennsylvania 
Rhode Island 1 


P 


M 


Texas 1 


M 


Utah 


P 


Washington 


P 


Wisconsin 


M 



c 

C&E 



C 
E 



C 
C 

c 

E 

E 
E 
C 



E 
C 



(English/Target) 



E&T 
E 



E&T 
E 



E&T 
E&T 

E&T 
E 



E&T 
E 



E 
E&T 



1 Were found to require language proficiency according to Waggoner (1 976} 

2 Information compiled from Waggoner (1976). Gray (1981) and Woellner (1982) 



The most brief and general references made concerning 
language proficiency of bilingual education teachers 
were .foundirt the regulations of the following states: 



Indiana 



New Jersey 

J ■ 



— "The candidate for the bilingual and 
bicultural endorsement must show 
oral and written proficiency in the 
target language." 

"The requirements are as follows: 
demonstration of verbal and written 
proficiency in English and in one other 
language used as a medium of 
instruction." (Waggoner. 1976, p. 29) 



In the State of Delaware, language proficiency could be 
demonstrated through completion of a "minimum of 
fifteen semester hours in the language area at or above 
the third year college level or demonstrate fluency in 
the language area of assignment as determined by the 
NTE score at the 50th percentile." (Waggoner, 1976. p. 
15) 
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Rhode Island also required a demonstrated proficiency 
in speaking, reading and writing in a language other 
than English. Proficiency would be attested to by the 
Commissioner of Education at approved colleges or 
universities in Rhode Island. In Arizona, proficiency in 
the second language or English must be verifiedi)y the 
language department of a regionally or nationally 
accredited institution. The New Mexico Department of 
Education included several specific requirements in its 
regulations. Candidates in New Mexico must 
demonstrate fluency in the local dialect as measured by 
observation in the field as well as an extended 
functional vocabulary of classroom terminology and 
literary .skills. In Te#3te. proficiency in English and the 
target language at the highest level for which 
certification is taught is required. Proficiency is 
determined by a public school committee as designated 
by the Texas Education Agency (Waggoner. 1 976). 
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Figure 1 

BILINGUAL TEACHER CERTIFICATION MODEL VARIABLES 



/ 



STATE 
CERTIFICATE 



STATE 
ENDORSEMENT 



Minimum 
Competency 
Qualifications 




Methods 
course 

Materials 
.coupe 

ESL instr. 
course 

Assessment 
course 





Oral Interview 

Spontaneous 
language 
evaluation . 

Demonstration 
lesson 



Norm-referenced 
standardized 
test 



Criterion- 
referenced 
test 



Simulation 
lesson 

Videotape of 
actual 

lesson 

Observation 
checklist 



Waggoner (1976) describes California's requirements 
and states that target language proficiency must be 
verified in oral comprehension, aural comprehension, 
reading and writing. Proficiency is verified through 
successful completion of an examination covering each i 
of these areas. The examination may be administered / 
by the. Commission for Teacher Preparation and 
Licensing, a'commission-approved California institution 
or the governing board of a local educational agency or 
county superintendent of schools. 

\ » 
Since the Waggoner study, several states have 
introduced bilingual education legislation with 
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provisions related to bilingual teacher certification (see 
Table 1), Maine, New Hampshire and Louisiana have 
specified that bilingual teachers must possess skills and 
demonstrate proficiency above and beyond that which 
is required of general elementary and secondary school 
teachers. The . three states, mentioned abqve have 
utilized the bilingual teacher certification models 
devised prior to 1975 with individual alterations. All of 
the states currently requiring certification use one or a 
combination of the variables listed in Figure ,1 to 
evaluate bilingual teacher s candidates. 




Illinois Perspective ' 

In the State of Illinois bilingual educators and 
certification speciuliats have been actively investigating 
the area of language proficiency of bilingual teachers 
for the past fifteen years. When the first bilingual 
programs were initiated, tethers were desperately 
needed to fill the newly created bilingual positions. 
Since most of the first programs were created in 
response to a need to serve limited-English-proficient 
Spanish-speaking children, teachers were mainly 
recruited from the ranks of Spanish, language 
departments. 

Requirements in the early years included fluency in the 
target language and residence of two or more years in a 
country where the target language is used. Another 
means for satisfying certification requirements was to 
have been raised in a home where the target language 
was spoken. i 

Any attempt to require minimum language proficiency 
and to determine whether teachers met thos§ 
minimums was done informally at the district level. 
More often than not, school districts did not have * 
anyone on staff who could make those judgments 
concerning the target language. As a result, teachers 
were hired for bilingual education positions who 
possessed varying degrees of proficiency in the target 
language and in English. 

No formal testing was conducted in the early years, 
rather school officials were asked to make use of the * 
advisory council in determining the languacjetY 
qualification of teachers. Later. Office of Education" 
monitors would visit the schools and informally 
evaluate the teacher's language proficiency (Seelye, 
1976). 

In 1976 the Illinois State Legislature passed the 
Transitional Bilingual Education Act (Illinois School 
Code, 1976). Included in this legislation were provisions 
for transitional bilinguar education certification. This 
was manifested as a nonrenewable six-year certificate 
which was required of all teachers employed in 
mandated transitional bilingual education positions. 
The mandate required fluency in the target language 
apd communicative competence in English. 



Following the adoption of the Illinois bilingual mandate, 
procedures i/vera developed to conduct language 
proficiency tefrtiiife of bilingual teachers, Candidates 
were to be tested in English if they had graduated from 
an institution of higher education where a language 
other than English was used as the medium of 
instruction, If the candidate had graduated from an 
instituion where English was used, the candidate was 
tested in the target language. The assumption made 
was that graduation from an institution was sufficient 
evidence that the person was proficient in the language 
used for instruction in the institutional setting. 

In accordance with the Rules and Regulations for 
Transitional Bilingual Education (1976), testing was to 
be conducted only in the areas^of speaking and reading. 
Educational Testing. Services's (ETS) Foreign Service 
Institute (FSI) Surveywas selected as the measurement 
tool to be used in evaluating oral proficiency. Several 
persons affiliated with the Illinois Office of education or 
the Bilingual Education Service Center were trained in 
conducting and evaluating oral interviews by the ETS, 
The Spanish Language Proficiency Test, published by 
the Modern Language Association was adopted for 
, evaluating Spanish reading ability and the 
Nelson-Denny Reading Test was chosen to test reading 
ability in English. 

In an effort to determine the minimum proficiency level 
to be required for demonstrating fluency leading to 
certification, the State took language samples from 
employed bilingual teachers and evaluated their 
language abilities using the FSI Language Proficiency 
«... Interview Scale (see Table 2). Teachers were examined 
in both English and the target language and proficiency, 
levels were found *o range from Level Two to Level Five 
in both languages. Because of the shortage of teachers, 
minirriuip proficiency was set at' Level Two in English 
and Level Four in Spanish based on the FSI scale. The 
tower proficiency level required in English was justified 
based on the contention that teachers conducting 
classes all day in the target language need not be highly 
proficient in En'glish. Therefore, it was decided the 
"communicative competence in English meant that 
they (teachers) could just function rudimentarily in the 
school building*' (Seelye, 1976. p. 38). 



LEVEL 



Table 2 ... 

LEVELS OF PROFICIENCY OF THE LANGUAGE PROFICIENCY INTERVIEW 
(Educational Testing Service, 1979) 



STANDARD 



Level 1 
Level 2. 
Level 3 



Able to satisfy routine travel needs and minimum courtesy requirements 

Able to satisfy routine social demands and limited work requirements , 

Able to speak the language with sufficient structural accuracy and vocabulary to participate effectively in 
most formal'and informal conversations on practical, social, and professional topics 



* Level 4 Able to use the language fluently and accurately on all levels normally pertinent to professional needs 
Level 5 . Speaking proficiency equivalent to that of an educated native speaker ' - 
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Six institutions have been approved by the Illinois State 
Board of Education to serve as testing sites which 
administer the Transitional Bilingual Education 
Certification Examination Testing sito approval has 
been consistent with approval for certification 
programs in bilingual education, Tho institutions which 
now sorve as testing sites are: Governors State 
University, Illinois State University, Mundelein College, 
Northern Illinois University, University of Illinois at 
Chicago Circle, and Wostern Illinois University. In the 
last three years, however, only Governors State 
University and Northern Illinois Universit^rWe offered 
the examination on a regular basis. V, 

Since the establishment of the first testing sites, 
procedures, instrumentation and standards have 
changed and/or vary from site to site. One institution 
requires testing in both English and the target language, 
while the others only test in one language. Although the 
MLA and the Nelson-Denny are used at most 
institutions, the Pruetya de Lecture and 
criterion-referenced tests are also used by approved 
sites. Since most of the testing sites were established 
several years after the original procedures were 
introduced, many of the individuals involved in testing 
did not take part in the training provided by ETS. In 
1980, the State Board sponsored a special training 
session conducted by ETS. However, not all individuals 
involved in interviewing and evaluating oral proficiency 
were willing or able to participate. The fact that all test 
evaluators and interviewers have not received the same 
training may result* in discrepancies and less than 
perfect reliability in the evaluation results. 

The Illinois State Board of Education is presently 
engaged in the process of changing certification 
requirements for Transitional Bilingual Education 
Certification. Included in the changes are requirements 
in coursework in bilingual education as well as 
language proficiency. It seems, therefore, that this 
would be an ideal time to reevaluate the existing 
language testing procedures and implement changes 
which would ensure that: 

a. certified bilingual teachers are available in 
. sufficient numbers in all languages, 

b. the language ability of newly certified teachers 
is sufficient to enable them to function in 
bilingual classrooms, and 

c. teachers whoxcan function in bilingual 
classrooms are able to pass the testing criteria. 

In addressing these issues, consideration must be given 
to the questions presented in the opening paragraph of 
this paper. Each of these issues will now be discussed 
and suggestions made concerning implementation of 
changesjn language testing procedures. ^ 
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Language Testina'lasuea and Implications 

• LANGUAGE AREAS TO BE MEASURED 

In ordor to convoy a sonse of tho elomonts of languago 
proficioncy tosting, Figure 2 includes a graphic 
representation of the compononts of language 
Theorists conclude that language is composod of four 
elements. They includo: phonemes (sounds), lexicon 
(vocabulary), syntax (grammar) and semantics 
(moaning). These elements are communicote'd through 
expressive an<^ receptive modes and can bo conveyed 
through vocal or graphic channels. Tho testing, of 
language proficiency should measure all elements " 
(directly or indirectly) in both modes and both channels, 
♦Thus,*a bilingual teacher candidate should be tested in 
the areas of understanding, speaking, reading and 
writing utilizing instruments which measure the 
phonemic, synt^ettcTfexical and semantic elements of 
language. Since these are • usually ; integrated in 
language use, separate tests need not be administered 
for each area. However, instrumentation should be 
analyzed to insure that all elements are measured in the 
total test battery. These procedures would insure that 
successful candidates have an adequate sound system, 
sufficient vocabulary, proper use of grammar and an 
understanding and ability to convey meaning when 
using the language. These characteristics are basic to 
the communication which is needed to be successful in 
a bilingual education classroom. 



Figure 2 
COMPONENTS OF LANGUAGE 
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• LANGUAGES TO BE TESTED 

Jf a bilingual education teacher is to be aucceaaful, a/he 
must be able to communicate with everyone a/he doala 
with. Since Interactions occur among monolingual 
English speakers as well as persons who are 
monolingual in the target language, the bilingual 
education teacher must be proficient in both languages; 
Graduation from an institution -where a specific 
language is spoken has not proven to be sufficient 
evidence of a person s proficiency in the language used 
in that institution, Similarly, college (Credits earned in 
language courses, cannot verify a person's language 
capabilities. The testing of language proficiency in both 
languages utilizing valid procedures with valid 
Standards appears, at this time, to be the best method 
of assuring that a potential bilingual education teacher 
has the skills necessary to be successful in a. teaching 
situation. • , . 

LANGUAGE PROFICIENCY REQUIREMENTS IN 
ENGLISH AND THE TARGET LANGUAGE » 

The level of proficiency required in terms of minimum^- 
standards should be determined separately in each 
language. These levels should be based on the language ^ 
behavior expected in each language. Reading 
requirements should be based on the difficulty levels of 
the material the teacher is expected to* read in each 
l^ngua^e. Similarly, the different contexts in which the 
teacher must speak English And the target language 
must be taken inta consideration when determining the 
minimum oral proficiency level required. These 
minimums can be determined separately through the 
procedures described in a later section of this paper. 

• VARIATION OF LANGUAGE PROFICIENCY 
REQUIREMENTS * . . , t . 

The great shortage of bilingual education teachers that 
existed in the early 1970s no longer exists. 
Spanish/English bilingual programs for the most part 
are able to hire qualified, certified personnel. A shortage 
does, however exist in some languages such as Hmong 
and Laotian. The Illinois State Board of Education 
should maintain an annual record of such shortages and 
deem special status to those languages for which 
teachers are scarce or difficult to hire. Special 
temporary certification could then be granted to 
qualified persons in these languages. A qualifier to this 
certification wpuld ailow for the t§acjier to teach only in 
the target language until passing the regular certificate 
examination. At that time, individuals who did not 
previously achieve the required cut-off score or above 
would receive the appropriate credentials forJ)ilingual 
teacher certification. 

• CRITERIA USED TO DETERMINE LANGUAGE 
PROFICIENCY 

One Qf the most essential characteristics which all 
testing procedures must possess is. validity. "Validity 
refers to the extent to which the results of an evaluation 
procedure serve the particular uses, for whiclvthey are 



Intended - (Grohlund, 1978, p, 79), If the r««uiw are to be 
uaed to ddtermihe the ability of a person to function 
linguistically In a bilingual classroom, then the criteria , 
should reflect the specific behaviors which are needed 
to provide aa accurate an estimate ol future success as 
possible. Testing procedures for bilingual education 
certification purposes should attempt to evaluate the 
candidates' language proficiency In relation to the 
following: 

1. Ability to read texts and reference manuals; 

2. Ability to communicate with students, parents, 
teachers and administrators; 

3. Ability to present oral instruction in a clear and 
understandable manner; 

" 4, Ability to write to parents, %i$0phers and 
administrators; / 
5, Ability to write original content for lessons. 

The above criteria phould be used in selecting 
instruments and in establishing the total test battery. 
Measures should be used to obtain an estimate of the 
candidate's undef standing, speaking, reading JHid 
writing ability within the social and educatlQ^fel 
contexts in */hich the language will be used. Although 
actual observation of the candidate within these 
environments would not be feasible, the content of the 
instruments must reflect an adequate sample of the 
universe of language - behavior required of bilingual 
teachers. 

• SETTING THE MINIMUM LANGUAGE 
PROFICIENCY REQUIREMENTS 

Livingston (1978) described an eight-step procedure for \ 
conducting an empirical standard-setting study. This 
procedure can be outlined as follows: 

1 . Identify the behaviors to be evaluated 

2. Identify valid measures of those behaviors. 

3. ' Identify- judges Qualified ' to evaluate the 

performance of an individual. 

4. Select a sample from the population of persons 
for which the standard is to be set. 

5. Obtain evaluations from the judges. 

6. Analyze the data and establish the probability 
of error. j 

7. Consider the seriousness of the two types of 
errors: passing a candidate whos'e performance 
is inadequate and figji6 a candidate whose 
performance is adequate. 

8. Establish the standaftl at a point determined by 
the acceptable probability of reducing the two 
types of errors listed in #7 above. 

Language proficiency requirements in Illinois should be 
established utilizing these procedures. The behaviors to 
be evaluated are the language areas mentioned earlier 
in this paper. Specific tests, including modification of 
the FSI meeting the criteria listed in THE CRITERIA 
USED TO DETERMINE LANGUAGE PROFICIENCY 
section of this paper, should be identified and 
developed. A group of judges can be identified which 
are representative of all persons qualified to evaluate 
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tho adequacy of a candidate's language proficiency 
Those may include bilingual taachtfis, coordinators, 
principals, oducatmn bptiruilints dnd laachtH irainofn 
fluent nr ul lilttr.iltt in tho liinguagi* to be addressed 

Ttjji | scores from t h ti language mousums selected can 
t)0 collected from a sample of potential candidates at 
one of ' (ho pmsent testing sitoa The data from tiny 
?>arnple can Ihun ho analy/ud following, the 
«ici:f»piubih t y |udQin«nlfi technique described by 
Livingston (19/8) Those procedures dutormirm I ho 
probability of iwors is connidwod scores Once tho 
seriousness of tho two typos of orrors is considered and 
an acceptable probability of orror determined, this 
information can bo used to sot tho minimum cut off 
scoros for tho various measures 

Conclusion 

''Thtt state of tho 4fT of language proficiency testing for 
bilingual toachor candidates is ono which can bo 
characterized by diversity, flexibility and fluctuation, 
Tho impact of federal and slate legislation which 
ultimately dictates whether bilingual programs will bo 
mandatory or permissive greatly affects tho status of 



bilingual education teacher certification Fluctuation in 
public opinion which is refleefAd by legislative 
manifestations contribute to the rrjiny changes that 
have been experienced by states ainclihe mill- 1 970's 



l anguage proficiency testing for toucher certification is 
developmental in most states which combine one or 
more variables to achieve Iho model best suited to local 
needs Those variables include university coursework/ 
English and largot language fluency. English and target 
language literac y, and classroom teaching competency. 

In one state, Illinois, bilingual teacher certification has 
changed considerably over tho past several years, and 
combrnations of the above variables have been 
oporationalizod Presently, efforts are underway to 
reexamine certification criteria using a multicriteria 
approach, Each slato must look to its own requirements 
and rosources to determine the best certification plan. 
However, a plan that inculcated the language testing 
issuos discussed in this paper in response to the 
questions found in the introduction would have 
universal application for bilingual education teacher 
certification. ' 
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'CUIffllbULUM PLANNING 

SOME RECENT DATA A N D MpBEfcS ;V 



■'ft- 



. - FordhWuHiv^ 



are 



Durces 
Education 



teacher tralnftiglpro'grams within institutions of higher 
' l 'uca^nJi^m^-iA^7^97^ (Binkley et'al., 19811.. 
ttie" -/ |hfgrmatipn A ;.of/>ekplariato'rY and useful value is 
yaij^bi^to address th^se two crucial questions: (1) 
h^%ntic^||./factor.fs) , influence the process of. 
s t rt 0 1 ion a I j i q n within an IHE?, and (2) How^can the 
r dcess)of institutionalization endure in the period* of 
austerity"? Based upon a . variety of 
' ^appiipjjbhes^'^ploying literature' survey 'materials, 
V sg1^fe(Jf,corjiS^pts £6rr<}wed from educational change 
thebr^a^vcurrertt conceptual frameworks, my pap^r 
f : ;wi^ answers. to these important 

;l' 'qi^lres' )9je^alrifing *jd the past, present, and future efforts 
*."V3*t -^insjtitUfjb.i^li-zing' biling ual education within higher 
United States poignantly charts the •• decfj^^.t federal ^ Vfed □ fpti - : ;l p» - 1 h i ^ p a p e r. vi/istitutibnalization" refers, to 
support for -Title Vll^btlingual programs. Ijpr e^al^ple, whereby committed and 

singe? 19§1, Federal support for, basic varain t s. ? -. t ratrfihg ^PQ^^ 0 ^^^^!^ j^S™ ber£'Qf the formal higher education 
grants.^hd support services has plungec^ 4ljpeFce ; nt/^^ systematically integrate 

* " " ^ *$*\ " j- u;^.^' 1 ^^^ "'^TSP 6 rti !^ of .the-'.field: of^bili.cjgual- 'education, 



While retrenchment and scarcity of 

'rapidly changing the formal ' higfte 
landscape.", the .prpcessSpf JnstitutT 
inno^ftion -such. . as^ ; b j !|n gLmt^ bd M ca t i bij ^ittlTn^^ 

. posi~3£^ the^Vfears ahead" 

• heeds c^ 

to the cbrtsGquences of ecofioiAjc de^fg^£^|fie v ;N§w r ™'™ 
♦Austerity^ which forces us to ask ag a l^^^^^est^n 
What is the. purpose" of education? \n p^^^f; 
the puj^ose^f teacher 'and ^rofpssi&Ml- edti(iL^ f 
♦programs' within formal higher , ecfuc9tmn? : * A; J^silite 
publication froh^the* Office of Bilingual jgtiUcation 
^Minority Language Affairs (OBE^/^\*^^\e^'Baybni 
.the Myths: Title VII a no * Bilingual rEdligqtion in th$\ 




What .appears 5 " to be 1 happening todays is 
captured' bv\SchneideT (3 97^6.. p. 162) Jn\he 
and i^terprgratidri.'of the 1974 Biling ua^Ed 



Scenario : 



Jor^ the 



as,bnSj©f 4 

V.. m . 4? ■ 



5volutipn/reactiort»oV reform; 



speqifffc 'purpose . of preparing bilingual 

telligently-'and 
ty and majority 



)$Qt : ^ prpfe^srdnajs ..'and'* specialists to .;in 
. .^'i- ^ : sef»sitivei^^.^ryi^e lariguag^ minority 



& ^reaqtipnary/ piece^ of legislation wojtil 
. been t We Vthat* totally* detetetfjhr 

pfo^r-afip'' 'or'., one "whiQih- adlbpjte^ ^ 
; -:V$dmu^mfioh ; s initial pra^os^l^of f(^m^ttl6." 



students; 



VII .;ahd . o'tber ■.•qategoricUv P.teS'nara^j^^ a ' ^ 

l^qacfi* iw&^t r 



special, revenue .sharing 'ap 



instance/, the 



onus, "tor b i J ihpua I -i^icjJf t.U ra C 
* education programs 'would ha»e.;been removed 
, ' :, v , f ro'rri. ;,the ... federal , goverr\rT^t^ ? ■^ince^1lle , , 
decision or), the levtel^pf* ■expenditures ./for 
bilti&eual-bicultucal conterit wbuld h^ve been. a 



■ ■*. > proposal* to . -meet, the problems qf.the jiroitetl y 
'; Ehglish-spe^ki ng :no^ with a-bjlingual-bicurtural 

•:, -}sec6^-f^^u^qe witho&j[ any' "mandated 
cuftute/donreht (Emphasis ahead). .» ■ 

■■- s* '^-v- : \ '/i -■■}*:■*':>> v.. ■■ >j$; 

; A4thbugh : v ''rSGjJft^dccT:^ a naly^is discour^d this 
reaCtio.n«ary Result - jn. the case *bf . the. H97^Bilingua!; : 
Education Act, recfeht eve v ntK^.g v \\\e Administration's v 
attempt" to co river t Title . VII ioto ' a block grant, the 
steady -drop" m''f^ to amend current 

bilingual •e'duca'tioriy'legis.feti and dQ Kanter/ 

1981). attest to th.e'; major shift in phirosopfiiG^I and. 
prag/tiatic.' -erripfiases among ^important policy and, 
decisionmakers. 

With regard . to ther&ipjc of this paper — Curriculum 
Planning , and InstitwoTializing Bilingual Education — 
the literature and research are practically nonexistent. 
Except for scattered papers aod symposia at national , 
and" regional conferences (See the bibliography for a 
listing.}- -and a recent, descriptive survey of bilingual 



;^;(SSU'E OI^E:^What'critic?ivf^ctor or factors influence 

Arfsyrr'**- "■ ! tHe process^oftinstitutionalization within 

'vv*.-.V v,;,;c :,' .'ihes?' • ■ ■ . .... • 

fs question has\?een a major concern of 08EMLA. 
• iHEs;v^n;^ in"partic)jW; directors of biljfigual prjbgrams 
iWab,9?;fe'^ft'V involved in bilingual programs since the first 
r: \fed$^ available . to ' post-secondary 

^ihstltutipns. uhder the 1974 Bilingual Education Act. 
■■] Section 723 of the 1974 Act provided the impetus for 
y.-'Xhe, issue;' ; ■;/-> ' , 

■\ . v :■■ ... ■ ■ ' r . 

' ' . ."(A) (i) training, carried out in coordinatidh with 
0 ' Vylahy other programs training/ auxiliary 
I educational personnel, designed (I) to prepare 
i; personnel to participate in. or; for personnel 
v >, participating in. .the '^conduct of programs of " . 
•j*?'- ? bilingual education; /including programs 
:. v ^ for careeV 

' \* 'development;; advancement, 'and lateraf . . 

mobility, (II) to train teachers, administrators. .> 
v > paraprofessionals, teacher aide§. and parents. . U 
'V.- and (III) to train pfersons to teach and counsel : 7 >;ffi 
\, such persons, and (ii) special training programs ' : 
designed (I) to meet individual needs, and ft!) t0 
encourage reform^ innovation, and impfqye- 
ment in applicable' education curried fefr in 
graduate education, . in the structure vfcihe 
academic profession, . and in recruitment and 
retention . of higher education and graduate : ; 
y school facilities, as related to bilingual - . : 
education,.. (Public Law 93^380. Sec. 723. cited 
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f ij^Schneider. 1976. (p. 218). 
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Table 1 



INDICATORS OF PROGRAM INSTITUTIONALIZATION 
/ (Adapted from Binkley et al„ 1 981 ) 



INDICATOR 



ffix 



COMMENT 



1. Administratorsactively support program, « 



..2:lJVlost~nonbili^^ 
• toward progr^ri^r at least do not create obstacles 
\ ;to institutiort^ation. 



3. Some faculty are s upported (py institutional funds. 

4. Some faculty are on tenure track. , 



1. Manifested though allocation of institutional 
funds, tenure; formation of, bilingual advisory 
committees by , deansr "prSfessional regard" in 
which director and bilingual education faculty are 
held in the (HE. • *' 

2. - Although -administrator-support- is more-critical-to 

survival, negative attitudes of a large number of 
nonbilingual.faculty can be detrimental; an area of 
serious concern as formal higher education 
"downsizes" in the years ahead, 

3. &4. Related, to indicator #1; more obvious indicator 

of institutional commitment. 



5. Program is reported to continue in absence of Title 

: . Vll funds. 

6: Program operation jnvolves the efforts of several 

1 professional staff and^ not dependent on one or 

; ; two prime movers. 



7. Program^ is compatible with established 
% t - Mnstitutional'pribrities- 



8. Enrollment levels would be high erjough to sustain 
the program. , ^' 



5. "Expressed opinions are held opinions"; 
adrninistrators/directors report continuation of 
program without Title VII funds. 

6. , This point is debatable, depending upon the size of 
institution and degree to which program is coupled 
with existing university structures. However, one or 
two faculty members must be responsible for the 
management . of the program along with- its 
academic quality. 

7. Degree to^hich program's goals correspond to 
traditional purposes of a university — instruction, 
research, service; likewise, state bilingual 
education certification legitimizes the existence of 
the program within the School of Education. 

8. Probably the most important indicator of program 
institutionalization. Level of Title VII "support of 
tuition and financial support at the institution, 
geographic prpxinriity of . competing Bilingual 
Education prpigitam's, ancl^itate certification all 
differentially ipffgence enrollment; another area of 

..1 consternatipn^Sf title Vll support for training is 
; f reduced; * ■ . • , 
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Variable 



.;. Table 2 ' ■ 

VARIABLES INFLUENCING INSTITUTIONALIZATION 
' . ■ (Careyand Marsh, 1 980, P0$Qyn h^'-Mr > 

, . . . * ; ■ ' :'■ ••; Js- • KJ 



MS 



Scale 



Characteristics offthe 



1. The extent thattbe.prpposal ----- — 
includes the innovation as 

: , an^aspept bf|^ojec?t : , -\ . 

2. r^pposal; eniphasizes^. f. ■* 
. institutionalization, ^ ■. ■ 

of the<irinovatj^as goal. 

, 3. ^|fh^irtn^^^fis tjig Wy ; 

fats^ Wi t h : o t he r -/\" 

p^^t^dyivitij^JiQ; ,"' * 
■ * . : praiciic§):V ;; ; ;"'b ' . 



The innovation is an explicit 
goal for the project as 
stated in the proposal. 

■ . : ■ -j'* ■ 
Institutionalization of the 

innovation is an explicit goal 

fbr the project. 

^'The innovation is highly 
,v ,.jhiferconnect^ci with^her .. : 
v ; • p/&f§bt «ctiy i t ies . 



'.. Gh&fiaeterist^sof thejrisiitij«rffc affzer/ 

^l; <Thie ins^itiJrtibniCfifey . •>/ 

; 2: '. The ih^ti:tM fi o n#jj$£ ; : ( £ 

• time ^y^ft^j fn i^le rtie n t 
' V". thfe j^^^wtthia ^ 
. theX^^ct^;/ ; 

4; t^e^^urit of project staff 
vtjr^^tven to institutionalizing 
tne'innb^atiorv: \- 

' Charae^^stjc^ of the Innovation 

.1. The jnncyvation is an adaption 
4 " of ihn^vation developed/created 
. outsidehhe local setting. 

2 . The; ^novation is a tangible^ 
. ^ . product or program (degree 
■■ of transportability). 



• Th^lkey institutionalizer is a 
r^ftnber of the core project 
sfaff. . .' * " ' 

The key instltutionaJi?er is a 
v' tenured faculty- meStfb'er. 



Extensive staff time was given 
fb implementing the innovation 



Extensive staff time was given 
to institutionalising the 
innovation. ■ l :r i; V : '-^;'.'*^iii''- > 



The innovation came from outside 
with no local revision. 



The innovation is a tangible, 
easily transportable product. 



The. in novation is not 

mentioned nQfimplied 
in the project proposal 

Institutionalization is 

not mentioned nor implied 

in the proposal. 

The innovation is not 
'connected to other project 
activities. . 



The key institutionalizer 
was not affiliated with . 
the project. . < 

The key institutionalizer 
is a soft-money project '• . 
person on the tenure track. 

Very little staff time was 
given to implementing the 
innovation, within the project. 



Very little staff time was 
given to institutionalizing 
the innovation. 



The innovatiqhjA/as entirely 
locally devel^ed. 1 :> 

The innovation is an amorphous 
entity which would be awkward 
to transport. ; 
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Variable 



Characteristics of the Institution 

;•'>/•/•,. /// " . 

1. The institutional leadership 
wants tc^insjjtotionalize 
the innovations V 




Scale 



Leadership provided funds, 
extensive support, and 
expressed strong interest 
in the innovation. w 



2. 



3. 



Colleagues value and want to 
institutionalize-tfeie 
innovation. . * . 

The innovation does not 
violate the turf of 
colleagues. ■* . 

The innovation has the 

the promise of generating . 

significant levels of revenue. 

The innovation is easily 
transjated'into the 
Administrative building 
blocks bfihe institution 
(courses; programs, . 
credentials, departments): 



Colleagues highly valued or 
Wanted innovation: 



No Intrusion on Existing 
faculty's turf. 



Innovation more than pays ; 
for itself when used by 
institution'. 

Innovation ^xplicitly.framed 
in term's of IHE courses, 
program requirements or 
^*ctedertfials.' . 



Leadership did not provide 
funding; no expression of 
interest in innovation (or 
negative reaction). 

Colleagues were neutral 
or opposed innovation, ir.'/ 

Extensive intrusion on 
existing faculty's turf. 



Innovation cannot pay, for 
its use by institution, 



Innovation very difficult 
to translate into IHE 
courses, program 
requirements or 
credentials.' 



In a subsequent !stli^'Carey aridjM^h ( 1 980) used a 
•-ysteprwj^^ the relationship 

? ■'betW^Bri^ 1 t^e /.■•model's, predictor variables, i.e. 
characteristics^ the project and institution, and the 
achievecj leygKy institutionalization, for each Teacher 
Corps innovation. "fcffjbir findings indeed shed'some light 
on the issue of critical, factory influencing educational 
change. °"|tl9 se f° ur predictor -variables;, accounting for 
70% of the* variance, were stroi^y ^ssoMted.with thr 
achieved level of. institutionalization. (They^e listed ' 
their order of importance.) ' •*/ 



2. 



4. 



Colleagues value and want to institutionalize 
the innovation. 

The amount of project staff tinrre given, to 
implementing the innovationjwith the project is. 
important. . 

The institutionalizer is a member of the regular 

faCUlty. 

The innovation '-i^Wiadaptatioh of innovation 
dqveloped/created "outside the. local setting, 
(pp. 38-39) ; "*\ 



these results clearly indicate that each category of their 
model -**- characteristic? of the project, of. the 
institutionalizer, of the innovation, and of the institution 
.— made an independent contribution toward 
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- accomplishing successful institutionalization. Among 
the implications of their studies, these three stand out:, 
(1) .theiorocess of.institutionallzation entails a cluster of 
fact6rS^bbut^-the-.ihn6vation f . the institutionalizer, and 
the institiJ^fbn, respectively, and neither one single 
variable nor cluster of variable^ in their findings 
explained the total variance; (2) individuals interested 
in increasing institutionalization! cg« manipulate these , 
variables 1 with some assurartcLe ftfeat levels of 
institutionalization can be mcrltse^ and (3) the 
importihce of organizational lafy&es in any effort to 
innovate. \ VV - 

*On the basis of this review,^ the, research and literature / 
pertaining^ to critical * : factors that influence V 
institution^l-te&tion within an IHE, ope can conclude that 
a combination of tecfdrs J^0^ to be ide/itffiedi ( 
Although considerable overlap^ ft evident in the^thfeje 7* 
models presented/ they all': negate • the fallacy of 
conceiving of institutionalization as a single, valued 
process. Further research is necessary to empifi(Jally 
verify any aggregate of factors (See Note. If.. I 
recommend that IHEs explore these critical factors 
along with others not mentioned within their own 
geographic settings, the .extent to which, they ; have 
institutionalized their bilinguai programs, and ways to 
sustain " student enrollments in an era of economic 



decline. The latter remark leads directly into the next t 
issue of this paper. 

' '/ "■ • 

ISSUE TWO; How can the process of institu- 
tionalization endure in the period Of the 
"new austerity?" ' 

This issue has many "angles" and "rough edges" to it. 
e.g.. the attitude of institutional decision makers toward 
the meaning'and effectivenss of bilingual education, the 
beliefs of bilingual faculty concerning their role and 
future within formal higher education, the extent to 
which the Federal government will downsize training" 
grants in the next five years, how the State will respond 
to and advocate the ' need for qualified, bilingual 
personnel and the images we have of schools and the 
types of change that correspond to. each (Firestone, 
^OK'The question also assumes that some form of 
♦bilingual education will persist, thereby raising the 
< cfeiRdSc ofc strategies* that wpyld b^m05t<^upeQS£(uJ, e.g., 
Employing a "cooperative" strategy, consolidating 
^current gains, appealing to private and corporate 
funding authorities, etc. 



While the issue of 6 whether or" not bilingual education 
will survive in po^-secondary institutions at aJJ^h the 
absence of Title VII funds is an import^mVone, 
ultimately, its resolution will depend upoV-the 
commitment, capacity, and competence 
individuals who t)|long to the institution. 

What steps, then, might be considered .to ma* 
/plbcess of institutionalization tfciat will be a vi J 
the education of language minority and #ma 
sf&dents in this country? Since 1 do not ha' 
crystal ball that spq L 4&^^ifc^ipnpwing 'pa^acel*^^ ^ 
Easy Ways tb Gu|?0^ Occdpatforf 
■ r "in the Lean .Yea r.^^&f^g pnl^e'^r ibe some of 
my own t h oug h$"a hifea^ sen t ty make 




coupled" systems" connotes the degree of bureaucratic, 
heayily rationalized linkages that are easily located 
within institution, e.g„ the bilingual teacher 
edu'cationVp'^^with its directors, staff and other 
visible organisational element?. Another positive 
feature of this perspective is that it points out the 
fallacy that institutionalization can follow traditional 
models bf educational innovatioh. To this writer, the 
image of loosely ar$ . tightly coupled systems" 
expresses the fluidity* and structural stability ^ that 
characterize this process. Since 1975, the bilifigual 
faculty at Ford ham has been successful in "coupling" 
with the College at Lincoln Center to establish an 
undergraduate bilingual teacher education program; 
with the Division of Administration, Policy and Urban 
Education gf the School- of Education, to provide a 
professional djploma "program for bilingual 
administrators afg|supervisorsf with special education, 
to develop a- bilingual special education master's 
degree; with the Division of Psychological and 
Educational Services, to establish a bilingual urban 
school^psychology program; and finally, within the 
Division of Curriculum, and Teaching, to provide a. 
master's degree in bilingual bicultural education|Ihe 
concept of "fcosely coupled systems'* allows oneBtte 
on the "lookout" for unexpected and poteflfrally 
promising bonds." See Appendix A, Tordham University 
Higher Education Programs" for more information . 
• (Justiz et aL 1980, pp. 157-160). 

at is important about Step I is th^ each V. 
stitLgjonaTrzer c make explicit his/her.jpwn. ^ 
^h^rsWPrding of 'institutionalization, along^ith 
rgnificant others" within the institution. 



/lt>Secure approval and 
;<r -^ -:t>|t N6;9Mal program(s). 



certification of your 



tj^t < A s 

Sfn?fe. ; gs. T bh'e^pu9ht not 
iV- 3or\p fenced" change ; 



sense of the 
Firestone (198£ 
assume a univer^*j$f 

'projects must b<^f^ m 
.which-they will be used.". Hopefully/my : fd'eas will assist 
readers in their own deliberations. The bibliography can 
6|; consulted for other sources such^as Arciniega (1980) 
and Carey and Marsh (1981): Ue^p|i^end these few 
steps.fnb priority given to then ^B^^^|: 

Step I.' View the process of \n^0f/^^zai\on as an 
adaptive socioeducational .ritocess combining 
/; the best elements of loosely Wd tigtitly coupled 
systems within -th£ academic System of the 
institution. . 



.In my 
(1980 



opinion, this view 'simply responds to Weick's 
, p. 1) observation that "parts .of the organizations 
are~heavily rationalised, but many parts also prove 
intractable- tc^ajialysis'through^ratjpnal. .assumptions" 
The image oj^oS'ely'coupied systems" captures the 
latter whereby structures/elements within the institu- 
tions, e.g., bilingual education and. administration, 
preserve ' their own identity, uniqueness, and 
.separat^$& over a period of time. The. image of ."tightly 



Besides legitir^zirtg'Jhe'program wjthin me institution, 
this step just might ^assure future enrolments: ■ At% 
tordham. we have recently,decided to "tightly couple" 
with the Colleger Lincoln; Center and therefore- to 
initiate a four year liberal arts program with certification' 
in bilingual elementary education? We've come to 
realize,. that bilingual education at the elementary level 
is absolutely nec^ssaVy. My own research (Baecher. 
1981, 1982). and that of Cummins (1981) pertaining to 
tfte cognitive lear&ing styles of bilingual children and ; 
'the developmental aspects of bilingual proficiency, 
respectively; point 'to the need ..for early bilingual 
intervention in learning and augmentation^ cognitive 
styles. And one place to begin is at the^pre-wrvice level, 
adequately and forcefully documented irt'Gaspar (1981) 
dealing with Hispanics and the U.S. economy , and the 
Research Bulletin of the Hispanic Research 'Center 
describing Hispanic American diversity (19§1). . 

Step. III. Don't abdicate your role as bilingual educators^ 
and leaders within your institution. r * y 

Vast transformations have been predicted for the 
educational- system in the ye^s ahead. While these 
changes are inevitable and poS 
impacting bilingual programs 



ibly beneficial, thereby 
and their curriculum 
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Another indication that the IPT ignores characteristics 
of L2 learners is the fact that language proficiency 
levels are correlated in <|r<i<lo levels. While it is possible 
to correlate a chikl.fi LI l/in<juago development with 
his/her grade in school, such a correlation in the case of 
the L2 learner does not reflect the same amount of 
exposure to the language. A child who is a new arrival 
and who is beginning to learn English in the 6th grade 
can not be expected to have the same jangu^jfie skills as 
a native, English-speaking child who is.M^e same 
grade, furthermore, such a system .failed accurately 
distinguish among L2 learners in the sfafnb grade who 
may have differing amounts of exposure', to English due 
to their date of arrival in the United States in the case of 
older immigrant children, or due to- sporadic school 
attendance in the case of migrant children. 

Construct Validity 

The IPT has even more serious problems of a theoretical 

-nature. io<tfxat. rtftJ^Sip^onsiruct^ whjgh H [purports ,tp^ < 
measure is not clearly defined. While the IPT is labeled 
as an oral language proficiency test, the word mastery 
is used interchangeably with proficiency throughout. 
The point is made that linguistic competence must be 
inferred from linguistic performance d3ta and that the 

-constructs which underlie linguistic performance and 
upon which the IPT is. based are the following: a) 
language is developmental; b) ifijs incremental;' c) it is 
systematic; d) it is used to communicate in . a social 
.context:- and e) it involves both' receptive and 
productive skills (Daltoh. 1^79, p; A-3). It is implied that 
some; of the 5 constructs which underlie linguistic 

performance, as described by the author, are inherent in • 
the IPT due to its format, which places the ehj|fd in a 
dyadic social situation with the examiner Wl^^J|/ the 
child must comprehend and produce P^^llj^^? 1 ^ 
more difficult language. From the phitd'Sj^|0bfises, - 
his/her linguistic performance is evaluated andJflfe level 
of language proficiency is determine<i^Dalt|P^tat^;*; 
"Validation of the' above constructs can be iKf&rred if 

* the data generated through the (construct validity) 
study demonstrate positive correJation with the age and 
grade of the student and the IPT level resu^s!' (Dalton, 
1979. p. A-4).- " ■ : . ,%*X : j>: 

' The/ conduct validity study was connoted on 364 
students in a suburban school district in California. An 
analysis was made of the correlation between age and 
iPT results on form A and B of the English version of the • 
lest: The data were further analyzed after excluding the 
scores of NEP/LE.P students which re suited in a sample 
which was^o'mprised of native speakws of English or. 
L2 learners who were fluent speaker^of]j;nglish. The 
same procedures were followed to dehma correlation 
. between IPT results and 1^i§tudents' grade level. Not 
^surprisingly, the study obtained ^high positive 
^ correlations between IPT .results ajn^a^^and grade 
level with high levels-of significant on both forms ; ;of 
the -test. , " 

,'VHiaving shown that a strong positive correlation existed 
^between the age and grade of native speakers, of 
^-'ish /nd language proficiency in their own language. 



' Dalton concludes that thoso data' validate the 
constructs which state that language is developmental, 
incremental and systematic. In addition, jhe data are 
also said to validate the use ofjho IPT as a method of 
assessing oral language proficiency through linguistic 
performance. 

The fact that there is alstrong positive correlation 
between the age and grade in school of native speakers 
of English hardly seems to be a worthy subject for 
research. Furthermore, one has to wonder why the 
construct validity study was conducted and what its 

•applicability a£ a rationale for a^language-prbficiency 
measure for L2 learners is, An even more basic problem 
; with the^study is its definition of linguistic performance 
which, is composed of vague generalities which are not 

.accurate. While language ir{gy*be' said to be systematic 

. and incremental in its development, one's linguistic 
performance is not. Linguistic performance is a 
speaker's use. of his/her linguistic competence 
(underlying knowledge of 'the system of a language) at 
* e "anV one time." IrT'a "fiaWe" "fcjbeaker " of a language, ■ 
linguistic performance may-vary from one occasion to "\ 
the next due to such factors as memory, distractions, 

. shifts of attention, etc. (MaClay & Osgood. 1 959. p.,24) 

Language proficiency measure^ a tt^pnpt to accurately^ 
measure one's linguistic competenceJI^^ferencfi/frorn' 5 '-"^ 
linguistic performance data wbicfc invariable. Dua.t^.; 
Jhis fact, a more germane cons^Qt upon whi^^tS>l. : 
■ ' measure a test for L2 learners is, wfiiat is the naturS'oF 
. jco'mmiinicative competence? What knowledge must 
;^»ohe possess ihi^rder to produce^grammatically correct 
i-' "speech'in sitUatibn'&l!y.;appropriate contexts? Such an 
approach requires the^'rticulatibn and validation of the . 
learnef^ftsftages of language acquisition in' the areas of 
syntaipc;i rjjorphology, lexicon, phonology, functional 
' language and sociolinguistic skills. A comple.te^heory 
; > of f communicative .competence would / also have td v 
,Srder ; and weight items tp.jndicate which are the most ^ 
^crucial - for achieving intelligibility in a language. While^r 
%ucf? a theory has not been fujly developed to date, 
researchers, in the-area^of language testing are working 
toward tjjpt end. .Federal and state regulations which 
re^j3fJ{i^4&nguage proficiency testing for. children in" 
bi^bu^prograf^shave caused great strides to occur.in 
the ^i^^nTecent^ars. Unfortunately, the IPT does-net; , 



refl 
field' 



^^|^!^|hfebry and sugg%ect practices in 



the 




Content Vajidit^::;;^.- ^#$pi- x & • r ■ " 
Content validity is the ; ^ : ^^ 

rej^TQSanfative. sample of behavior in the dpmainvto 
measure^.^. The IPT purports to meas u re < 
mastery it^ the areas of syntax, lexicon, phohdfogy. 
morphology, comprehension antSoral expression arVd* 
reports extremely high percentages of items which** 
correspond to and measure each one. This stems from 
the fact that test items were scored for all of the 3kovj| 
categories which could possibly pertain to a resppfi^ 
and not to those which are Reflected in the IPT seeing 
system, Eor example, tbe.item. "Who is tri'is"? which' 
requires^he student to supply the lexical it^m. teacfight 
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CURRICULUM PLANNING: 
IMPLEMENTING PROGRAMS THAT WORK 



Anne M, Kiefer 
Bilingual Education Service Center 



ABSTRACT 



The design and implementation of language assess- 
ment programs involve professional staff In a complex 
change process. Recent studies of program design and 
implementation suggest procedures which would 
convert that complex change ptocess into a more 
specific series of steps toward a goal. They also provide 
a series of recommendations for program planners. 

This paper will apply those recommendations to 
assess ment processes and the decision making 
involved in language programs for limited-English- 
proficient students at a local school or district level. It 
will make use of the theories mentioned above as well 
as concepts of assessment procedures! effective 
programs, inservice and staff development, in addition 
to the core theories of language assessment which are 
the focusof this institute. 

The summary will include a series of recommendations 
for an adaptive implementation process 'which takefy 
into, account: the level of 'certainty involved in the 
theories or technology of language assessment; the 
,|§)/els of stability -in the environment; the kind of 
:^^rganJ?atidnal structure (loqsely or tightly coupled); the 
i*^w8l of eorijfjict among persons involved in the process 
' (confl^.ov^gQals'and means}; and, finally, the scope 
of the change involved in ; the new program. 



INTRODUCTION 

The key to establishing programs Vof language 
assessment thaPwOrk is to establish a flexible change 
process of implementation before beginning to design 
the program. ■ 

... .v";^ r 

Pr69$ms of language assessment that "don't work", are 
thasewhich are totally~$esigned ahead of time%it$n<3 
4 provision made for th^r process of change. They'ire 
, programs that have not allowedjor the uncertainty and 
j am b^uity involved in languageMearning and language 
assessment theory and technology. Their planners have 
. not made provision for the conflict resulting from that 
uncertainty 'nor from differing ideas on - teaching/ 
. learning processes, oh evaluapon and on instruments 
which are effective for assessment. .These programs 
have acted as if schools : and school.- systems are as 
- tightly structured as areL^usiness organizations with 
\each unit operating as a part of the whole. Thus they 
ihave not. acknowledged educational organizations as 
"loosely coupled" (Weick, 1982), nor have they made, 
pr4*nsions for the resulting best t practices. The 
"not-working" programs have not examined the context 
. of the environment nor its. stability or lack of stability. 
Finally, they have not. analyzed, acknowledged or: 



provided for the scope of the change involved in the 
program being designed and implemented. (See Figure 
1, Berman, 1979.) _ 

Using Berman's terms to summarize: they have not 
provided a planned process of adaptive implementation 
which woulc^nswte.gjatogram that worked. 

Figure 1 , 

* > MATCHING IMPLEMENTATION TO SITUATION 



Situational Parameters, 



1. Scope of change 

2. Certainty of technology 
or theories 

3. Conflict over goals 
or means 



Implementation Approach 
Programmed Adaptive 



minor 

certain 
(within rii 

low 



4. Structure of the setting tight4y (coupled) 



major 

uncertain 
(high risk) 

high . 
loosely 



5. Stability of the 
environment 



stable * unstable -■• 

(Berrn&V, 1979) 

In too manyK^ses, the programs that did not wpr(ciA/ere 
designed by one or two persons In isolation, •formulated 
under time pressures, implemented in augment -of a 
school system or a part of a schpol buildup given an- 
identity that confused or irritated staff and assigned to 
persons unfamiliar with the total school environment 
and the system's environment in the community. There 
were -no" provisions made for flexible adaptation to 
changing theory, technology, va(u?s,. opinions, 
.resources or methods. If you cRange ^ir perspective 
on whabwas just stated, you will recognize that what 
was intended was an efficient, carefully planned and 
precisely implemented program. But because it did not 
take, into account all of the previously mentioned 
variables, if' became an imposed, resisted, attacked 
invader in the schooi system . / - 

Since 1968, we have seen mounting conflict centered 
around education for language minority childreh, We 
have seen bilingual education attacke'd, resisted and 
condemned at all level? of both education and society, 
even though not in all situations, le 1968, few persons 
were yet aware of the scope of the change being 
asked/demanded of the educational institutions in the 
United States of America. There was no agreement 
then, nor is there tqd 
acquisition or as: 
instruments to b.e^! 
to many deep cult 
to the power :i§s 
k educational respu 
tangential to the pre: 
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the character of language 
the metr%<^ and 
is/ of course, tied 
e differences and 
ihe allocation of 
hOvyever, is only 



Figure 2 




f^^Ll^-^Kiefer (after Berman) 



Ait 

We ar% only now b«$ihnif% w 'to -realize how loosely 
school/systems ,are coupled or tied to each other. We, 
therefore, do not have formulas for operating with such 
systems, nor do we have an understanding of ajl of the 
implications of that kind of Organizational .pattern. We 
are only now learning management strategies to deal 
with the instability produced by changes in the 
environment where schools operate. 

The remainder^ this paper will describe a process of 
design and -implementation ,Jhat takes the above 
concepts apd uses them, as a base for planning and 
organiziogTangu^ge assessment programs. 



PLANNING ¥On CHANGE V ^ «- 

If we want^urs..to be one of the language assessment 
programs that work, then we^ave a complex change, 
process ahead of us. First, we will have to-acknowledge 
all of the variables involved. We cannot simply choose a 
test and tell teachers to give it. Second, we will need to 
collect information on the environment in which tht 
program will be implemented and tfte persons in that 
. environment. Thirdv-W^will>eed both courage and 
patience to live witfTuncertaTrity. keeping the assurance 
that we^ill become mor^^ertain. Uncertainty and 
ambiguity ar$ both forces|5^.(ch can paralyze us into 
f inaction ahd/6r force j us ir|rdJrbitrary pronouncements 
' thaWe.wnVregret. • . > 



Do we know the present status>of assessment 
in.the system? 

Do wfrknow who directs assessment and what 
1 ' ^ 3 3sessment theory? 

\Amat instrument's >are used? ' ' ■ , .- 

t • . * . 

Do we know how decisions on .assessment are 

made? f" ^. 

(A person? a committee? with or without. 

recommendations?) 

Is language proficiency assessment a major or 
a minor change ixi thisas^es^rient program? 
Is there a form of languagf assessment being 
. used for another program? . (Speech ,apd 
Language PrograrTf? Speci^al^EdGcatioh? 
Reading? Foreign Language?) ■ 

, How can we relate to arld^*^ 
T district program in our own planning? 



So: there ■ are certain *$5$gs that we can 6§?m 
uimplefnettt,language assessment programs that work. 
These ^ngs" are related to the following questions 
and^r|^!part of planning C^ge. They establish the 
enyironrrint and start the planning^ _ 



5. Who is involved in language teaching/ 
assessment for other programs? 

6. What theories of language learning are 
generally accepted?. 

7. How familiar are staff members with th^e 
theories? With kinds of tests? With aspects^ 
language that are assessed? With using test 
data? ... 

8. How familiar are staff members with language 
and reading theories? • ^ 



9. How much conflict Is there among these 
"language people" about the theories of 
language? 

10. What are the various segments or units of the 
school or system which would be involved with 
the language assessment procedures? 

11. How are these units related and how do they 
communicate? 

12. Is there a central committee to which new 
programs need to be submitted? 

13. What projections have been made on students, 
languages, systems growth or decliqe, eta? In 
other words, what other changes are in process 
or being projected? 

14. What resources arQ available? How much 
money can be speM for tests? For training 
personnel to £dmif)iatfir them? For scoring, 
grading, reporting^ and using the results of 
tests? 



It is possible for one persoto to sit alone in an office and 
work out answers ^ t^ese questions, then seek 
information on the arOT&vyhich are unclear, ft is also 
possible and much morerWfective for a core group to 
meet and analyze these questions, clarify and discover 
common understanding of these concepts and then 
proceed to the next step: formulating a description of 
the situation with a clear policy statement that directs 
program development efforts.* 

" v *"* ■ * ' '" v ' '" 

Why wait until a program "is in operation to discover 
that you have a different concept of assessment than I 
do; that you perceive different purpqses. for an 
assessment program than ! perceive; that you know 
more about the district assessment program than I do 
and that neither of us has a clear picture of the staff 
available to do the "assessing"? Even if you sit alone to 
formulate a beginning to focus the discussion, you 
need to verify your perceptions and check with 
persons on their perceptions and^ncepts. A p/ogr 
that works will have a core disStiSsion to initiate Ihe 
process of program design and implementation. 

In the second step, this group can formulate a list of 
further information needed as well as sources for tfjat 
information. The members of the group can establish 
the process and time line for collecting this information 
and fitting it into the total design process. 

It should be obvious that one necessity is time: time for 
rf talkiQg -and clarifying, time for research, time for 
analyzThg, time ior deciding and time for training. One 
of the most effective ways to use time is to spend it at 
the outset to make the process cltear. We may not know 
what we are going to do, but we really are clear on ha 
we are going to do it. I ■ . ' . ^ 



THE PLANNING PROCESS 

The key to effective implementation is a process that 
allows for and encourages an adaptive process of 
change. The first step in that process is designing the 
process itself, The chart on the following page (Figure 3) 
outlines the steps which should follow that initial 
discussion. Each of the steps will be discussed and then 
applied to a sample development of ft language 
assessment program, (It tan -be mentiorwd herejhat 
ttys is' a developmental prbcess-that in wfect has no" 
end. A polity nfiay 1 develqp 4 ^Mch^(J the opftositeof the 
first .we^^ a step 

tbW^ljro jgnplefTiepjatlgo • and refinement of the 
origmil^ooram.) #> /'' 

Change needs Jo be considered as an ongoing 
developmental prdcess of increasing precision and 
jefinqirfent in policy. and program, 



Policy Development v> . 

A policy statement is a formulation of a consensus of 
the group on a particular response to a problem or 
condition. 

Example: r 

Problem: We do not know how nor when to initiate 
read lih& Instruction for language minority 
students. 

Policy: All incoming first grade language minority 
students will be given an assessment of 
oral language proficiency vto determine 
which language provides a skills base for 
the reading. process. 

Figure 3 

PROGRAM MANAGEMENT FOR CHANGE 




j^VHAT WHO 

■ : — * : — a— 


HOW 


1. Policy Statement Total Staff 


Consensus 


2. Program DehtQfT Committee \g#h ™ 
Staff Input 


task Force 


3. Implementation Manager 


Following Design 


4. Evaluation Total Staff 


Following Design 


5. Conclusions Total Staff 


Consensus Decision 


6. Revised Pblicy Total Staff 


Consensus Decision 




(WHEN- 



WHERE) 



At this stage we acknowledge that we do not hove the 
experts w/io con tell us with corfoirMy that we have 
made a perfect choice of response. Wo cannot ovan be 
sure that wo have made the bust choice. Wo know that 
there is controversy over the relationship botwuen oral 
skills and literacy skills, We know there is no cwtaint^ 
that we will not change the policy when we havo \ 
accumulated more information on the students and 
more skill in assessing language and predicting reading 
success, We can prevent sudden and arbitrary changes 
of policy and program by establishing regular times for 
evaluation. We can be sure that wo have agreed uporn a 
course of action that seems to start us in the direction 
of clear and careful planning for the instruction of 
children. We also know that the policy will be reviewed 
after a stated longth of time and after carefally 
collecting facts about what happens to the students. 

--The-tevtew- wi-ll-enaWe-us -tQ .re-decido with more 
precision. This is in no way an experiment. An 
experiment is designed to see if something works or 
how it works. What we are doing is acknowledging that 
information, on language Is exploding along with all 
'other categories of information. It is no tonger possible 
to know all there is to know before we make a decision. 
(One of the current problems is that language research 
was not available to today's teachers a^ ihe time they*, 
were being trained. Since 1950, and especially in the 
last ten years, we have profited from many persons' 
efforts to leam^exectly what language i$ and how we 
acquire it.) 

A consensus, therefore, on what course will be followed 
is the first step in the process of seeing how a program 
that will enable us to teach/learn more effectively can 
be developed. We can be paralyzed in our efforts if we 
expect to know beforehand everything that we wi|l 
know after the program is implemented. * 

(A side note: we have to trust ourselves as proYessionals 
to discover what the questions are and to seej< from the 
"experts" answers for what we know are ourquestions.) 

Policy determination is best done after clarifying all the 
concepts arid coming to a consensus decision* of the 
iv< persons involved in the program. . : <' 



Design 

This process, however, seldom works for the next task. 
The - discussion and consensus-building, decision- 
making process flounj^when a group tries to design 
the im plementati^and 'eV0mtip_n steps to see that a 
policy becomes pperattyft To'hav&a task force of about 
• five persons receive' djrectrons. %m the group (A list of 

"w^ff out fo^-p^se "don't; etc.) before 

, .:....«,*. efforts 

"might give the following 



kieginnipg the F^^tt^i^ 1 



Back to the sqmpleTa sl§7 
tasks to the group: 

Do- Plan for multiple language groups, not just 
Spanish. 



Do— Provide both pre- and post-tests. 

Don't-Use norm-referenced teats, 

Do-Provido inserviceto all staff. 

Do-Check with the Director of Pupil Personnel 
Sorvices for a representative to review the 
( *i design. 

i 

Do— Bo finished by July 1. 

...... • » 

■;M- » 

etc. 

This way the task force heirs, ahead of time what the 
group will be looking for in the final review. (Few groups 
seem to be able to identify all these points ahead of 
time, but theMMiyill usually be a sufficient number of 
-caveats" to provide direction for the task- Jorcel 
Projects that "don't work" find that the long hours of 
designing- a program can be wasted, and the staff 
frustrated if their project is severely changed by the 
committee review. All members knoyy.th^t they need to 
be flexible apd respfind to criticisms and suggestions, 
but the "do's and don't's" can. focus their timo and 
efforts to prevent some of the frustrations. 

The sample again- A. design. t\a$ all the information: 
Who is going to do what to whom, when, where and 
with what. It will list the personnel, the resources and 
materials, the process, the persons involved, the dates, 
the person or persons responsible, the locations, and 
the questions which will direct the collection of 
information fof the evaluation. It will, most of all, be 
aimed at providing a process to solve the original 
problem: Will this design for implementation and 
evaluation provide us with a way to decide on the time 
and the language in which to initiate reading 
instruction for language minority children? 



DATE TIME 

July 30 10:00 am 

Aug. 15 10:00am 

Sept. 2 2:00 pm 



GrtOUP 

inservice Staff • 
Inservice Testers 
Awareness Session 



STAFF RESPONSIBLE 

v . 

—Program Manager 
—Program Manager 
-Manager & Staff 
— All Secondary Staff 



Sept. 10 9:00 am 
Sept. 20 



Begin speech and 
language screening. 
Evaluation report on 
screening: rjow many 
screened, how many 
identified as needing 
testing, etc. 

The complete time line/action plan will lead to a 
formulation of all the resources, both personnel 
materials, that will be needed to implement and 

Evaluate the program. The design, therefore, needs to 
anticfpate problems in all aspects of implementation 

, and evaluation. When the design is completed to the 
satisfaction of the task force, following the* 
specifications of the staff, it goes back to that staff for 

f finalizing, then to the program manager to begin. to 
make it happen. . 
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^jpjementation and Evaluation 



As a transition to ^mptemerrtatlon, one 

manager told mo that ho mrnftn uflo'ihb exact language 
of the design and policy for training, fpf awareness 
sessions, and tivon for the 'necessary reports. .Thus/ 
feedback could be coded to certain sections of the 
- design and used for both formativo and summativo 
ovaluation of the program. 

Remembering the sample again; We atm the 
implementation at the original problem; docisions heed 
to be made on initial reading instruction; Our program 
pt evaluation, our -choice of tost instruments, our 
training of the testers, our analysis Of the test results: all* 
should provide us with facts that can clarify the 
decision on reading. Implementation (operationalizing 
and institutionalizing the design) also oporationalizes . 
the evaluation process. For. even though implemen- 
tation is listed as step three and evaluation as^p four, 
they are concurrent processes. 1n addiflon, they are 
both now a matter of taking the design and seeing that 
it happens 

Our evaluation will begin with the question: Do we now 
have sufficient data for decisions? If the answer is yes, 
continue and refine the process. If no. why not? What 
have we learned about the instruments, the aides giving 
tests, the reading series, the opinions, values and 
attitudes of the. personnel that will enable us to refine 
the original policy or the design? The sample again: we 
might find that: 



Those monolingual in another language may 
not have a sufficient language base for reading 
in either language. 

The tests we have chosen do not provide what 
we need. . ■» . ■ 

Some native speakers ^English do not have a 
suff.cient language base for reading. 
We did not sufficiently train those who. 
administered the tests. 



In other words, we wHI^ able to pinpoint |£e reasons 
why. we still need more information on students to 
**adapt, the design or the policy, We now. have an 
alternative to scrapping the procedure ^and either 
" ceturrjipgr to thfe- former arbitrary decisions or 
~ floundering, with random attempts, to" discover "what 
■ went wrong." 



We may discover that the policy is, correct and 
implementation igeffectiv^pr "that" the implementation 
went weir but the evaluation, information was not 
systematically collected, recorded and analyzed. In 
sh<^.we fc £hou^ » n * a useable 

format to/tfeternWn^G^ to the results of the 



Mr * 



evaluation: 



Summary 

To "make programs' wori^r we. need; , 
to anticipate, 



fbploji/ 




(investigate, ■ • • 

evaluate, 

and <fi bo flexible in all the prgcossos without boinrjf 
arbitrary and/or, casual in our* decisions, We should, 
hove a progrqm that works in our own terms*, we know;, 
why we are doing what we are {Joing: hoVv we are doing 
it; who did or did not.' do what: and the opinions ah# 
reactions of those involved. •> 



We will be a^le to describe: 



the scope of the ch 
change weaccomi 
^he amount of 
the organizati 
the program ex 
the stability of tl 
the cert^ir\tyBM' 
technologyinvtS _ 




ge W« projected and the 

;er goals and means, -^f?; 
; % of setting in which 



aihty of the theories and 



This collection of papers ^ttdijQjy adds to your general 
understanding „ of ^tfll^lfte^ .of thfose theories, as 
uncertain to say^tjgJeip^Hh'tf conflicting to say a little 
more. Perhaps kn^|S^/)owv we will work can provide a 
compensation forjfmi uncertainty involved in the exact 
content of what we need to do. Perhaps a definite, but 
flexible, change process can direct our efforts to 
determine and refine what needs doing. 



APPENDIX / 

Appended to this article are two drafts of the results of 
the process of planning. Patricia Chamberlain of the 
Illinois Resource Center in Arlington Heights shared 
what is being developed by a staff in a local district The 
charts are examples of the increasing precision.that can 
be obtained in describing exactly wh^rwill be done and 
how. The samples provide a demonstration of how the 
group understanding grows and how the myriads of 
details/ involved in a program' begtn to organize 
themselves in a vv^ ; .that^ supports and encourages 

implementation. v: : ' ' * 

" v ' * ■ 

There may be ar third draft and a fourth draft-as well, as 
the knowledge that. Subject to both formative and 
summative evaluation, the "final form" need never^andi 
may never be developed. ' 



9 

ERIC 



-203- 



APPENDIX 
DRAFT #1 
July, 1982 

ENTRY/EXIT PROCEDURE 



POLICY 



Students will be identified 
by the Home Language 
Committee 



DESIGN 

All anitfnng atudenla 
will complete th« Home 
Language Survey form 



IMPLEMENTATION 



EVALUATION 



Studenttt that answer a 
language other than English 
|p questions 1,2 or 3 
will bo screened further 
via testing and observation 



Children should be given* 
an English language 0 
proficiency test 4 



Children should be, 
given an achievement 
test in English 



Placement decisions will 
be made by a building 
committee. 



All children in bilingual 
programs should be 
tested in . native 
language. 



~ An preschool achieve- 
ment data will be reported 
,..,<to the receiving teacher, 



Children wij^bW^en^ ^ 
Ihe Language Assessment 
Scales inEnglish'fl-8). 

-.Children (4*6) will bo 
given either the LA§ pr 
a language-sample racing 
system. 



-Children will take the 
LABS, 

-Children will take the 
SAT (where possible). 



tv- 



the Committee will consist 
of the bilingual teacher,, 
principal, the receiving 
teacher, parents and other 
informed participants. 



Children (1-8) will be 
tested using the Language 
Assessment Scales in 
Spanish, if appropriate. 

— All children will be 
tested using the 
California Test of 
Basic Skills (CTBS) 
Spanish. 



Students must achieve a 
level .4 proficiency to exit 
the bilingual program 



Students must achieve 
. the following on English 
achievement tests to ' 
exit the bilingual program: 
70% or above on LABS or 
2/3 at or above district 
norms on the SAT 



Placement may includebi- 
lingual program, all Erigftsh 
program, ESL only Title I, 
Special EducationYeferrals, 
others. 



Children will be tested 
in the spring of each 
academic year. 



A follow-up evaluation ^ 

will be conducted twicea \ 

year as part of teacher/ ' V", V 

principal meetings. V 



-Children will not 
be tested using the 
Boehm test of basic 
concepts. » 

-Children "(1-3)will 
be tested using the 
Santillana Reading 
tests. 



Preschool students 
will be tested. 



POLICY 



DESIGN 



/ IMPLEMENTATION 



V 

£VALUATfON 



Lincoln, Paul Revere, 
Whittier, Greenwood, 
Greenbriar, and Kerr 
Schools will house 
bilingual programs. 

Pull-out: Bilingual 

teachers will receive 

an aide if they 

teach more than 3 grade 

levels. 



Lincoln will adopt an inte- 
grated model. All other 
schools will adopt a pull-out 
'model. 

A job description for 
bilingual teachers will t 
be written. 



(For pull-out programs) 
^Schedules wUI be developed 
cooperatively by the 
bilingual teat her/ 
classroom teachers, the 
building principal and 
the administrator of 
bilinguafprogram. -<f> 

Teachers in both the 
integrated and the pull- , 
out models will have planning 
time as designated by the 
Professional Negotiations 
^Agreement. 

Principal may provide up to 
60 minutes per week release 
time for pull-out teachers for 
testing, conferencing and 
translating. 



Level 1 & 2 bilingual 

students (1-8) will 

receive 150 minutes 

per day of instructional • 

time in the bilingual 

program. 


This incudes native 
language in reading, 
language arts, science, 
social studies, math, 
and ESL instruction. 


The Santillana Heading in 
Two Languages Series * 
in Spanish will be adopted for ' n 
grades.K-4 for reading. 






J 

... . . " ■' / 




The Santillana EGBSenda. y 
Series Will be adopted for^" 
nradp*? fi-8 fnr rpadinn and 
language arts. 


4 




Other eligible students / 
will receive a minimum / 
of 90 minutes per day - / 

nf intitniptinnfll timp / 

in the bilingual program. / 


Content areas will be ' 
taught in native language' v 
based on individual * 

nppHc 


The selection of supple- 
mentary materials in science 
and social studies will be 
madft at thp fall hilinnual 
monthly staff meeting. 


• .1 ' 


• 


. . * ; ' 

/ 




— Addison-Wesley's Math 
in Our WorJdWsW be 
adopted in Spanish for 
grades 1-8 for math. . 






■i 

/ 




—Scott Foresman Hike 
English will be adopted 
for ESLMnstruction 
in grades K-5. 






.p 

> 




—The selection of supple- 
mentary E9L.materials for 
grades K-8 will be made 
during the monthly bilingual 
meeting. 






v. 


\ 


—Additional money should „ 
be made available for 
language/experience sup- 
plies and activities 
for grades K-8. 


\ 




Formal reading instruc- * "S 

tion will be^iven^ 

in the student's * 

stroYiger language 

cognitive. 
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POLICY 



DESIGN 



New bilingual staff 
will be trained to work 
with the limited- 
English-proficient A 
students. 



The bilingual staff will, 
meet once a month during 
the school year. ; 



A basic" skills curriculum 
in Spanish language arts 
will be developed. 



Training will be provided 
in the following areas; ( 
bilingual/biculturai 
education, district 
and state policies 
and procedures, and 
district bilingual cur- 
riculum. 

« The bilingual administrator, 
* bilingual staff and princi; 
pals of bilingual buildings 
will be invited. 



An ESL curriculum Will be 
developed. J 

Report cards will be re- 
vised. 



,Two sections will be added 
to the existing report card: 
Spanish language arts and ESL 



IMPLEMENTATION 



EVALUATION 



Training will takeplace 
two days prior to the 
opening of the school. 



October 4 Institute Day will 
be designated as a work ses- 
sion for bilingual teachers 
to begin curriculum develop- 
ment. 



POLICY 



DESIGN 



NONBILINGUAL PERSONNEL 

IMPLEMENTATION 



NonbHingual personnel » 
will be provided with re- 
, sources and information re- 
garding the bilingual pro- 
gram and student. 



Materials for LEP students 
to use in the nonbilingual 
classroom, will be made 
available tp'all staff mem- 
bers. 



Administrators and nonbi- 
lingual staff will be^pro- 
vided inservice on bjlingual 
policies and procedures as 
soon as possible during the 
82-83 school year. 



Two copies' of Handbook for 
Classroom Teachers of LEP 
Students will be dissemina- 
ted to each building with 
LEP students by the project 
administrator. 

The selection of resource 
materials will be made at 
the fall bilingual staff 
meeting. The materials 
will be housed in the 
Learning Media Centers 
at each building. „ ^ . 



Bilingual materials avail- 
able in individual buildings *> 
yvillbe combined and housed 
'intheLMC. * 



EVALUATION 



{■ 
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POLICY 



JC , 
APPENDIX 
DRAFT #2 
September, 1982 



DESIGN 



IMPLEMENTATION 



EVALUATION 



Lincoln, Paul Revere, 
Whittier, Greenwood, "Green- 
briar and Kerr Schools will " 
house bilingual programs.. 

Lincoln will a^iopt an inte-' 
grated model. All other 
schools will adopt a pull- ' # 
out model. 4 • 

Pull-outi 

Bilingual teachers will re- 
ceive an aide if they teach 
more than three grade levels. 

A job description for bilin- 
gual teachers will be written. 



. (For pull-out programs) ' 
Teachers in botrTthe inte- 
grated and the pull-out 
models will have planning 
time as designated by the 
Professional Negotiations 
Agreement. The bilingual 
program administrator will 
•monitor compliance. 

Schedules will be developed 
cooperatively by the bilin- 
gual teacher, classroom 
teachers, the building.prin- 
cipal and 'the administrator 
of the bilingual program by 
time. 



Formal reading instruction . 
wj.ll be given in the 
student's strongerjanguage 
cognitive. 



Students will receive first 
language instruction based 
on need in order to provide 
for cog nitive growth. 



'Principal may provide up to 
>^?_0[l]lLMLe?_perweekre'fease... 
time for pull outteachers 
for testing, conferencing 
and translating. 



Level 1 & 2 bilingual 
students (grades 1-8) wjll * 
receive 150 mjnutes perday 
of instructional time in the 
bilingual program. 



Instruction includes native 
language reading, language 
arts, science, social 
studies, math, and ESL. 



OthW eligible students will 
receive a minimum of 90 
minutes per day of in- 
structional time in the bi- 
lingual program.* ■ 



The'Santillana Reading in 
Two Language Series in 
Spanish will be adopted for 
grades K-4 for reading. 



The Santillana EGB Senda 
Series will be adopted foj 
grades 5-8 for reading and 
language arts. 



The selection of supplemen- 
tary materials in science^nd 
social studies will be made 
at the fall bilingual month- 
ly staff meeting. • - 

— : Addison-Wesley's Mathjn 
Our World be adopted 
in Spanish for grades 4-8 
for math. 



Scott Foresman I Like 
English willtpe adopted for 
ESL instruction in grades 
K-5. . \. 

—The selection of supplemen- 
. ta/y ESL materials for 
" grades K-8 will be made 
during the monthly bilin- 
gual meeting. 

—Additional money should be 
made available for Ian- . 
guage experience suppties 
and activities for grades 
K-8. 



ERLC 



-207- 



1M 



New bilingual staff will be 
trained to work with the 
limitep-English-proficient 

students. 



Training.wilt be provided in 
the following areas: bilin- - 
gual/bicultural education, 
district and state policies 
and procedures, and district 
bilingual curriculum 



IMrLfcMfcNIMMUIN 

Training will take place 
two days prior to the open- 
ing of the school and be 
provided by .(person), ^ 




Report cards will reflect 
the bilingual curriculum. 

Two sections will be added 
to the existing report card : 
Spanish language arts and 
ESL 



The additions will be made 
by (person) by (timel 



STEPS TO DEVELOP AN ESL CURRICULUM 



'1. Clearly define levels: 
Example v Level 1 



1. Understand very little spoken English, but can distinguish some words/phrases. 

2. Can label certain concepts; i.e.. book. door. etc. 



2. Assign percentage of time devoted to four skill areas by level. 
Example 



Listeni 
Speakir 
Readir 
Writing 



Level I 
Beg. — End 

90% - 50% 



10% — 50% 



10% - 30% 



Level II 
Beg. — End 

70% -^50% 



30% - 50% 



-15% - 35% 



10% -25% 



Level I 
Beg. — End 

60% — 40% 



40% - 60% 



level IV 
Beg. — End 

40% - 20% 



40% - 60% 



Level V 



20% 



■ 40% ( 25% -45% 



1 5% - 30% 



20% - 40% 




Levels I and II should sperid more time-on listening and speaking. 
Levels'lll and IV should spend a little mpre tirne on reading and writing. 

3 Brainstorm for topics (survival) and then think of one language experience activity for each topic. . - ^ 
4. 'Brainstorm an^gn topics to different levels - ensure that the topics are spirted throughout, levels (when 
appropriate). 



ForBxample.: Level I: 



Greetings and Leave Taking Language Experience - havenKjds line up outside of 
r^om and walk in one at a time. Each, one should say hi, hello, how qte you. etc 



•' 5. Go back. brainstorm and think of appropriate language that fits the situations (topics) and functions. 
' . ' ' „ -208- . _ 
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NONBi LINGUAL PERSONNEL 



POLICY 


DESIGN 


IMPLEMENTATION EVALUATION 








Nonbilingual personnel ■ 
wjll be provided with re- 
sources and in forma don re- 
garding the bilingual pro- 
gram and students. 


Resource materials will be 
available in every bilingual 
building. 


Two copies of Handbook for 
Classroom Teachers of LEP 
Students be dissemina- 
ted to each building with 
LEP students by the project 
.administrator by (date). 










Materials for' LEP students 
to use in tri§ nonbilingual 
classroom will be made 
available to all staff mem- 
bers. '•- « 

+ 


The selection of resource 
materials will be made at 
the fall bilingual staff 
meeting. The materials will 
be housed in the Learning 
Media Centers at each build- 
ing. (Person)mW be respon- 
sioie lur irnpierneniaiiuM. 








- 




Bilingual materials current- 
ly available in individual 
buildings will be combined 
by (person) and housed in , 
theLMCby (time). I 










Administrators and nonbi- 
lingual staff will be provi- 
ded inservice on bilingual 
policies and procedures as 
soon as possible during the 
82-83 schooLyear. 




0 






A committee of staff re- . 
presentatives will be 
formed to review the bi- 
lingual program policies, 
procedures, design and'im- 3 
plementation, 4 


; c ■ 

, The committee should have i 

representatives from the 
*> bilingual program, the non- 
' bilingual sttfff, the admin- 

isirauon ana me prescnooi 

program. 


''The committee will meet one 
week in June. It will re-, 
convene in November and May. 

M uiMiiyudi piuyiaifi manual 

Will be developed by Septem- 
ber of 1 98f2 by (people) 








Parents^are Critical to 

st u^jjefits ^school, success. ^ 


The Parent Advisory Council 
will meet publicly a mini- 
mum of three times per year. 


District personnel as well 

do paiclllb oilUUlU Uc 111 

formed of meeting dates., 








: ; «— 

Bilingual records will be 0 \, 
accessible to authorized 
personnel. . . , * 


—5 : ; 

Bilingual Census data will 
. be housed in each bilingual 
\ building. 

«■• i: » . . 


The bilingual teachers will : 
keep the information. Data 
will be collected (time)b\f • ' 
(person). * . ' " , ' .' 


t 






\ 


\4 A bilingual student informa- - 
* tion sheet will be main- 
tained in the temporary 
cumulative record file for' 
each student considered for 
or placed in the bilingual 
program. (See attached 
sheet.) s ■ 


Each teacher will maintain ' 
up-to-date records on the „ * 
students in his/her class. \ 
The program administrator 
will monitor compliance. 






^ ( 


.A part-time bilingual pro- 
ject director should be 
assigned/hired for super- 
vision of the program. 




' f . ■ . 






- 


Parents should be ihser* 
^vicdd on a regular basis. 


Topics for parent education 
should include: Bilingual 
Education in Two Languages, 
The District Bilingual Pro- 
gram, and How to Help Your ' 
Children aUHome. 


Meetings should be held 
quarterly. 
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- School district #130 ^ 

Bilingual Student Information 

(to be filed- in students temporary record) v . 



Name- 



Birthdate- 



Address. _ 

Language Survey 



Parent- 



Teacher- 

Kindergarten Assessment/Evaluation laforrn$tiar*_- 



Jteferral date- 
Phone— 



ate- 
Date- 



YEAR 
GRADE 1 



SAT Total Reading/GE 
SAT Total Mafh/GE 

SAT Total Auditory/GE 

LABS(%C/M) or - 
LABS Survey test % 

English Language Proficiency-Test 
Test 



Date- 




Reading 


Test , . 


— i 






^ — ~ 




«• ~" 

Score * 




t 


; — ~ — 1 




V 

Math — ' 


Test 












GE/ 

Score , 




\ 












t 
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Z&qessment Committee Conferences 










' .# ■ . 


. 4 • 
»*• 


ham * - 
In Attendance . 




V 








/ *** 






C 




* 






• • 










i . 






Recommendations: * . 








— i-L- — — — - 






• * - ' ■ V- 




1 


1 









Date, 



In Attendance 



Recommendations: 















■Ma. 

















4 

. \ ' 






• 


r 






\ . 








• f . 

Date i 

In Attendance 






» / " 












* 


















t 




















Recommendations: 
















f _ ..... 




"A 










J ' ' i '■■ > * 
-a •« ^~ . - _ • - . 


f 











X 



0 
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POLICY 



DESIGN 



ENTRY/ EXIT PROCEDURE 

IMPLEMENTATION 



EVALUATION 



Student s hotne language 
vvill be identified upon 
entry: 



All studerfts will complete 
the Home Language Survey 
(HLS) form upon entrance. 



Students should be given 
tests in English "to de- 
termine theirjevel of 

"functioning in an all- 

* English program. 



Students should be given an 
English language proficiency 
test. * 
*• Students will be given the 
language Assessment Scales 
in English (grades 1-8). 

Students (ages>-6) will t?e 
given either the LAS or a 
language sflmple rating system. 



HLS will be reviewed tfy 
(perfon).The (person)wM 
assure the completion^ ttje 
form by (date). 

* Students that answer a lan- 
guage other than Englishto 

questions 1.2. or 3 will be 
. screened further via testing 

and observation within (timet 

Students must achieve a level 
4 proficiency to exit the bi- 
lingual program^ 

' Level 4 proficiency alone is 
not sufficient for program , 
exit. Students will be 
tested by (person) w\\ft\r\ 
(time). . 



Students should be given an 
achievement test in English. 

Students will take .the LABS. 

Students will take the SAT 
where possible. 



The tests will be adminis- 
tered by (person) on (date). 
* 

Students must achieve the 
following on English achieve- 
ment tests to exit the bi- 
lingual program: 
70% or above on LABS or _ 
2/3 at or. above district . 
.norms on the SAT. 



A follow-\p evaluation will 
be conducted twice a-year as < 
part of te*acher/princioal 
meetings. Is the student 
functioning at or above grade 
level in English? 



A committee should be formed 
to review all placements to 
ensure educational appro- 
priateness. 



Placement decisions will be 
made hy a buifding committee, 



Test data -wilj be reviewed by 
the commit tee within (time). 

Thfe committee will consist of 
the bilinguaf teacher, the N 
^teceivinS teacher, principal, 
' Uparents and <5ther informed 
participants. 

Placement -may include bi- 
lingual progranry all-English 
program. ESL only, Title I, 
Special^Education referrals, 
others. 

A . 

« The committee will meet 
(when). " * 



All students in bilingual 
programs should be tested 
4 in native language in order 
to have diagnostic informa- 
tion regarding conceptual 
development. 



Students grades.O -8) will 
be tested using the Language 
Assessment Scales in Spanish, 
if appropriate. 

AILstudents will be tested 
using the Celifornia Test of 
Basic Skills (CTBS) in Span- - 

ish, 

Students will not be tested 
using the Boehm Test of Basic 
Concepts. ^ 

Student^rades 1 -3) will be 
tested using the Santillana. 
' Reading; tests. ■■. r__ 



Bilingual teachers will test 
program students in the 
spring of each academic 
school year. 

New students will be tested 
by (person) within (time) 
of entry. ' * ' v .;' 



-212- 



9 

ERIC 



2uu 



\P0LICY 




DESIGN 


IMPLEMENTATION 


EVALUATION 








- Preschoc^ students will be 
tested using the [test) 
« . iwhen) 


AH preschool achievement 
data will be reported to The 
receivincpeacher by the 
' - seeding teacher (at the end 
of the school year) 


ft 




* 






The preschool teacher will 

5>^hinielor the* lad "\r\fA 

record the data on the 
student information sheet 
(when). 


€3* " 





9 

ERLC 



Bibliography 



Beal, G.M. and Bohlen. JM. "The -Diffusion Process" 
Models for Educational Change. Edited 'by A.L 
8ert*and rfftd R. Von Brock. Austin. TX: Southwest 
'Educational Development Laboratory. 1968. (ERIC ED 
025 361.) 

Berman. P.; Greenwood, P.yV.: McLaughlin. M.W.; and 
Pincus, J. Federal Programs Supporting Educational 
Change. Vol. IV. (Abr): A Summary of the Findings in 

■ Review. Santa MoniSa^CA^fland Corpocation r 1975. 

Berm^n, P. 'Thinking about Implementation Design: 

■ Matching Strategies to* Situations" unpublished 
manuscript. June/ 1 979. (to be published in Mann. 
Dean and Ingram,. Helen, eds.. Why Policies Succeexf 
anOfaii) 

\ v 
BogdanAp. and Taylor. S.J. Introduction to Qualitative 
Research Methods New*York: John Wiley and Sons. 
1975. - ' 4 

Corwin. R.G\Re form and Organizational Survival: The 
Teacher Corps as an Instrument > of Educational 
Change. New York: John Wiley and Sons. 1 973. 

Dalin. P. Case Studies of Educational Innovation. Paris: 
^"Organization for (Economic Cooperation, and 
Development, 1973. 

Danoff. M. et al. Evaluation of the Impact of ESEA Title 
VII Spanish-English Bilingual Education Projects. 
Palo Alto, CA: American Institute of Research. 1977. 

Eisner. E.W. The Educational Imagination. On the 
Design" and Evaluation of School Programs. New 
York: Mafcmillan and Company, 1979. * 

Gaddis, M.T. Organizational and Personal Constraints 
on the Successful Institutionalization or Individuafly 
Guided Education. Technical Report No. 447. 
Research and. Development Center for Cognitive 
Learning. Madison. Wl:. University of Wisconsin.' 
1978, (ERIC ED 151 932) 

> 

Glassman, R.B. "Persistence «and Loose Coupling in 
'..Living Systems." Behavorial Science. 16 (1973): 
83-98. . v 



Hall. G.E; Loucks. S.F.; Rutherford. W.; and Newlone. 
B.W. "Levels of Uses of the Innovation: A Framework 
for Analyzing Innovation Adaption" Journal of 
Teacher Education. 26. 1 (1975): 52*6. (ERIC EJ 115 
168) p ' 

Hall. G.E. and Loucks. S.F. "A Developmental. Model for 
Determining Whether the Treatment Is Actually 
Implemented." American Educational Research 

9 Journal. 14. 3 (1977); 26^-76. (ERIC EJ 180513) 

Havelock. R.G. The Change Agent's Guide to Innovation 
in Education. Englewpod Cliffs. NJ. "Education 
Technology Publications. 1973. 

A Linkage Perspective on Educa- 
tional Change. Ithaca, NY: Cornell University. 
Institute for Occupational Education. 1978. 

Herriot. R E. and, Gross. N.. eds. The Dynamics of 
Planned Educational Change. Berkeley. CA-, 
McCutchan. 1979. ^ " • . 

"Mann, D. "The Politics of Training Teachers in .Schools" 
Teachers College Record, 77. 3 (1976). 336-37. (ERIC 
* EJ 135 254) . . \ , c 



Miles. M.8.. 
Teachers 



ed. Irmovations in Education. Nev^ York: 
College. Columbia University. 

— . "Thinking About How to Do It: 

Alternative Models of Planning and Implementation 
of New Schools" Paper presented at the Annual 
Meeting of- the American Educational Research 
Assbciation, 19-23/April. 1976. 'San Frar/cisco (ERIC 
ED 120 898) * 



Pankratz. R. and MSrtray. C "Making Planned Change: A 
Combina{ion of Staff Development and x Political 

" Support." Paper presented at the Annual Meeting of 
the Association of % Teacher Educators. 16-18 
February. 1981. Dallas. TX. Mimeographed. Bowling 
Green:.* Western Kentucky University. Collage of 
Education, ' 



/ 



•213- 



So; 



pankwtz R : Tanner. J . Leeke. W Moore. B Planning 
for Institutionalization Mhe jSgntmuation of New 
Programs and Practiced Omaha. NB Teacher Corps 
Developmental Training Activities. Center forJJrban 
Education. 1980 

* Robert JME "Implementation of Innovations in 
Educational Organization and Instruction" Working 
•Paper Philadelphia. PA^Research for Better Schools, 
June 1978 (ERIC ED'180 072) I ^ 

Runkei P j\Schmuck. R A . Arends. JH and N Francftco. 
R P Transforming'the Schools Capacity for Problem 
Solving Eugene. Ofl Center for Educational Policy 
and Management. UfiiverS'ty of Oregon. 1978 (ERIC 

- ED 168 206) 



Taylor. BL. SuHivan* E W and Dollar. B Mapping 
Teacher Corps Projects A Planning Resource Book 
New York Center for Policy Research. 1978 

WeickVKE "Educational Organizations as Loosely Coupled 
Systems" Administrative Science Quarterly. 21. 
'"(March 1976) 1. / 

"Administering Education in Loosely 

" Coupled Schools" Phi Delta Kappan 63. (June 1982) 

Zajtman G : Flono. D and Sikorski. L A. Dynamic 
Educational Change^ New Jork The Free.. Press 
(Macmillan), 1977 x * . s 



2i)2 



9 * / 

ERJC 



-2M- 



' f ^NI^IBERS'AND NEEDS: ESTIMATING BlkfNGUAL EDUCATION POPULATIONS 



Dorothy Waggoner, Ph.D. 
Wagoner and Associates 
Washington, D.C. 



Introduction 



A variety, of estimates 
minority children who 



of the numbers of language 
need or might benefit from 
bilingual education programs has been put forward in 
the years since the first federal bilingual education 
legislation was passed in 1^68 These estimates have 
y'arymg sources and varying degrees of reliability. They 
describe populations defined in various ways They 
respond to different concepts of the benefits of bilingual 
education. This paper discusses some of these 
estimate:* It describes, in particular, the issues 
underlying The 36 million estimate resulting from the 
1978 Children's Englisrrand Services Study (CESS), the 
minimum of 3 4 million based upon findings from the 
CESS applied to the 1980 Census results — both 
responding to the definition in the Bilingual Education 
Act — and the less than*a million to 1.5 million estimate 
of the Department of Education's Office of Planning. 
Budget and Evaluation iOPBE), originally developed in 
connection* with 'the Department's Proposed 
Regulations for* cornpliance with national origin 
nondiscrimination — Lau compliance. It sets forth the 
implications cty the published estimates on language 
, minority children' from the 1980 Census and the 
relationship of those estimates to the limited-^ nghsh- 
' proficient children defined in the Bilingual Education 
Act. It affirms the need to support the presentxlefinition 
-of the target group in the Bilingual Education Act so 
that. alt lirnited-Enghsh-profictent ^hildren may continue 
to^be served and so that the advantages of integrated 
programs and of bilindjualism for minority and majority 
children in the wider society may be realjzed. 

There were in 1960 about- five, million* persons 'of 
school age (6-18) in the United States who had a 
non-EngliSh mother tongue. It is reliably estimated 
that over, three million of this group didjn fact 
retain the use of that tongue . . The situation is not 
known to have changed notably since 1960 These 
are the children we are concerned with; plus 
anothcr"million or so in the same category under six 
years of age and soon to enter the schools. They are 
necessarily and. unavoidably bilingual children. 

Bruce Goarder. U.S. Office of 
Education, testifying before Senator 
• Yarborou^h's Spocial Subcommittee 
on Bilingual Education, May 18, 1967 
* (Andersson and Boyer 1 970:1. 49-50) 

According* to t Me 1970 Census, about fiv« million 
Voungsters r in tfiis^ country come from homos in 
which the language generally spjjkoniin something 
other than English A^numt^r of these youngsters 
. . sffjok English, of course, but estimates based on 



recent samplings in several States are that between ' 
1.8and 2 5 million others need bnjj>^uaT^ducation. 

Article in American Education. July 
^1974 (U S Office of Education, DHEW) 

An estimated ,2.4 million children with limited- 
English language proficiency aged 5 to 14 were 
living in the United States in the spring of 1978 
This number represents 63 percent?of all children 
aged 5 to 14 yeaf^iiving in households where a 
language other than Eng|ish was spoken. In 
addition, there were estimated tot)e as many as 1.2 
million children of limited-English proficiency who 
were younger or older than the study group but 
who were also of schqol age. 

Josue*Gonza1ez announcing the results 
of the Children's English and Services. 
Study at \he 1979 NABE conference 
^tlMABE New\ June 1979:1) 
t 

In these proposed rules, the Eduqatton Department 
addresses one of the most serious barriers to equal 
opportunity in education. It* proposes standards for 
teaching students whose primary language is not 
English and who have limited proficiency in 
English. The number of these students is*large and 
growing. Estimates by thfc National Institute of 
Education and the National Center for Education . 
Statistics pJace the 'number of limited -English- 
proficient school-age children at over three and a 
half million. v % 

Proposed Lau Regulations. U.S. 
Department of ' Education. August 5. 
1980 (FederarRegister: 52052-53) 

■ 4ft 

We believe that the, number of children whose 
opportunities to. benefit from education are 
curtailed by dependence on a non-English. / 
language is almost certainly not more than 1.5 
million and possibly less than 1 million. 

^ . .Robert £ Barnes. Office' of Planning, 
Budget, arid Evaluation, .Department of 
i* * Education, in the draft final report "Size 
' ' of the Eligible Language Minority 
Population"* (September 25. 1981) 

In 1980, one out oftvvery ten persons (5, years old 
and over) reported speaking a language other than 
English at home . . . Well ovef ono half million 
school -age children . . ■ who speak a foreign 
language .at home characterized their ability to 
speak English «i« "not well" or "not at nil " 
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Bureau • •< lh- L.-u-.u ■ pie ■ ■ " '■■ > " 
!(,,. ,,t.,v. .. nal .-.Mu.it.--- '"' . 

i-l.-.W.' .I'lipii' it.flr.ll-- !'Ul. I 
* ' • * 

the H«-.k|.im Adeem anitu i 1 .ilf ,1;'' •('•■ 
,|„. i,,w rstl.iMt. II «1 .n,IN.i.;-. l the .u.mtvi ,»l 

<V'--d C< !..'IVi...". UIKlel- HlhM.iU.l' 

Edutahon Act ' Educator, lltiMfliufi.l ell^al* 
argue th.it students wtu se profic.endy >'• ImutoJI "i 
• both.the.r native language ..md English would not 
be effectively served* by native language instru<tion 

Article, in Education Wack. June 0. 
1982 (Eoster and. Matzko 1982 f>) 

Back ul these diftoicncos hi estimates' are a ruimber of 
• issues and implication There * the- issue ol Ik' 
definition of the group whose size is bttiffg estimated 
There is the issue- of who needs special, programs, such 
as bilingual education, and who can benefit from them- . 
There are sociopolitical implications in the size of the 
estimates The smaller the size of thejjrou^wleeh must . 
be served is "eligible" lor services, urcjn benefit from 
them the lower the potential cost to. the Federal 
Government, the states or the school district* The •' 
smaller the number, the r\iore =d,hicult it will be or 
concerned'minor.ties to obtain a hearing The-smaller 
the number'.' the easier fo"r. 'opponents to-disquss the 
.need •for special programs and to comfort themselves 
that it will eventually go away 

_ » 

The. issue of the. definition *of the group whose size is 
being estimated enveloped *.n connection with thrt? 
issuance of the Proposed Lou Regulations by the 
Diriment ot Education on August 5. U80. The -.. 
o'rigtnal Bilingual Education Act - Titla^V" < jI the 
Element-dry and Secondary Education Act ol Jbba as 
amended m -1968 - -defined the target group .as 
children of limited-English-speaking ability "who crime 
' from environments where the dominant language is 
other than English" (Andersson and Boyer. 1970 II. 1) 
The 1974 and 1970 amendments 'to the Act -changed. 
.- and refined the definition of the target group -and 
modified the < terminology. These amendments 
recognized that not all language minority Children have 
difficulty with English - an implication in the original 
definition Both specified that "the, language normally 
used by the parents of the child" should be considered 
the native language of the child Neither placed any 
restriction on inclusion in the target group, of the _ 
' limited-English 9p"eak,ng (1974) o< ' limited-English. 
" proficient (1970) on the basis of- the child's language , 
usage The 19,78 amendments further specified" that, in 
the case of Native American children, the Indian ; 
language need only td have had a "significant impact on 
their level of English language proficiency (Biltngffal 
.' Educatiorv Act. Sec. 703(a) (1),and (2)) \\ did not even 
' have to-be dpmirjant in their current environment 

The estimate of 2 4 million children. agedfito 14. or S<> 
million total of school age, represents thj^rowp which 
meets the Bilingual Education . Act definition i a* 1 
presently enacted This nrimber Tomes from the 
Children's English and'Services Study It -was produced 
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... H-spunv- Iv the Qngn't.ynnal m.md.tte !•. cunt 
|, m ,t t .d tt,gii-:»i ^..'t'l 'ei't' « ! 'Jdr.-n and a-lulf. -.n^ the . • 
BilmqjMl rdm atK.r. A' ' • H" "••'(!• -d l'» ? 4..nul M/H 
the i l S!..-- .tn".it< % s w. ri the ha- '• *' fi l , ! "'' . 

K\ nt\ ir.m Moitte.l in III- (.ei.gre .s la'-t ye.a;.i , i-piili' 
„,.,.,. ,„...■ I", the i.f?ind.it.-;.!h.'-.."e-,t.mat. > were and 
, t| ,|| the enlv available estimate* based up r a 

national -sample .1 iunyoige mu'iofitv ih.Ur.enwh> 
. were tested m ttieir homeJ. with an ub|ective measure of 
sch* >ol related EnglislV skills especially developed in 
Utis purpose Th«ste estanah,^ will *>« supersede) later 
this year by the I. ridings froin'a new study which w.ll be 
'discussed ctt the conclusion of this article t 

"'-The Proposed iau Regulations which were issued by 
the Department of 'Education A «n August 5. 1980. used 
Ihp 3 6 million estimate from the CESS However, they 
employed a complicated set of definitions, id the - 
children for whom school districts would be expected 
to provide services" to. comply with national origin 
. ndnd.scrimmatu* as upheld by the Supreme Court m 
iau v Nichols In the Pmposetf Rules, children, with a 
primary • language other irftwi English, who. were 
hm'ited English proficient (LEP) were to be classified . 
arcdrding to -then relative proficiency in English and 
• theirVimarV language Depending upon then relative 
proficiency, thpy were to be classified as English 
-superior, comparably limited, or . primary lan.guage- 
supqnor The primary language was deTmed as the first 
' Language tearfiedTir the language norrhtilly used by the 
student Different treatments were to be based upon • 
these classifications^ including ught of equal access to • 
comt.tfnsatorTprogra'msrtor LEP students classified as , 
■■Enqhsh.-SJup6r.or" proposed Lau Regulation Si 
i960 pars 100 33 and 10039) The CESS findings did 
not address these classif icatmns'excepMndirectly The 
' •Bilingual Education Act does not contain them 

The Regulatory Analysis Review Group (RARQ> "I the 
Council- en Wage and Price Stability of the Executive 
Oflu'e of the President- has the, authority uncter 
" Ic-mslatior. passed in 1974 to. intervene in goyernuvental 
tMl6mak.ru] It did SO in the cose of. the Proposed Lau 
fmgulations shortly after gie V were issued H explained 
" dtal it took this action because these were the first, 
important r.nei (regulatmns) to lie issued by the 
Dena.tjnent ,(of Education) s.nr.O it was established 
: S Council 1980) To' meet one of the requiremtflts of 
Ihe RAR-G. i;efr,onnel ,n the. Department-ol Education s 
'©Mice of Planning; Budget, .and Evaluation (OPBEI 
begnn attempting to estimate the numbers of students 
who "would be affmdod by t'he Propped Regulations 
ac'cdidmg to the relative ^olicioncy claVsilicatums 
ATthough oH LEP- children. . regardless of-relabve 
proficiency classification, were covered by the 
Proposed Rules, OPBE 'determined to reduce the 
numbers, tl those who- might he classified as 
v )fl majy language ^porTor The estimate of less than a 
m.lhon to no more than 1.6 million children comes from 
the" OPBE dralf final >iport. "Size of the Eligible 
language Minority Population" 1tys roport was. issued 
• wrtf'i a disciimei thai iVffOn'a not rppiesbnt the olhciul. 
nosftion of the U S l>ln)e\nrf«t of tducation. It -W-is 
i ! . !1 'i, w d ( a yi-ar-alter the RARG s mview of the Vnnwyy^ 

■ '- 9 'f J i- ' ■ ' s 



Regulations and six months after the inauguration of 
the Reagan Administration andihe withdrawal of the 
regulations^which triggered the&fort. This latter point 
is not unimportant/Having^embfrl^^upon revising the 4 > 
results of the .Department's effort" to meet the Bilingual 
Education Act lyantiate, although for Lau compliance 
purposes; OPBE appeared to be reluctant to discontinue * 
the effort even when the reasSn for it had disappeared. * 
The estimate was/ accordingly, proposed to be^ an ' 
estimate of trie Bilingual Education Act target>,group, 1 
The latter was redefined to consist only, of those 
considered to\be "eligible", for, bilingual services or 
•entitled" to receive them by virtue of "dependence on a . 
non-English . Language" (Barnes 1981). The Bilingual . 
Education Act program is not an entitlement program. 
Instead, . school districts apply for grants to.' serve LEP 
children they be||eve -will benefit from them. The 
Department's General Counsel disallowed . OPBETs 
limitation of .the target group to those in some serjgfe 
dependent on a non-.English language. The GC 'pointed 
out that dependence on a language other tjpan English 
is not a criterion for limited-English proficiency as 
defined in the Act. The GC's opinion led, in turn, to the 
Administration's proposed amendment to the" Act to 
give a funding priority to projects which would serve 
LEP children whose usual language is other than 
English. 

Back of this determination to restrict, participation in 
bilingual education programs with resulting reduction 
in the numbers is a pedagogical issue concerning the 
benefits to be derived from these programs. Consider* 
these quotations: ; 

Since. 'these limited-English proficient students* 
,% stronger language is Enqlish (the English-superior 
group), it follows that they would suffer less- of an - 
. academic^handicap if the academic curriculum , 
were taught in Englisrj, rather than in their home V 
. language.- : r' V 

, . . We would recommend that equally limited 
bilinguals who speak English at home would best 
be instructed in English, while those who speak 
. 'another language at home might benefit more from 
instruction through that language; (Dulay and Burt, 
1980:16, 19). / 

It is our contention that a child who is not ^ 
\ dependent on a language other than ' English, 
' regardless of the child's limitations in English, 
... should not require and would- not benefit from 
services provided* in the non-English language? 
(Barnes. 1981:17)" 

The research evidence . . . strongly suggests that 
programs that aim to develop a high level of 
proficiency in ; two languages provide greater 
potential for academic development for a//children 
' ■ than education through the medium, of only one, - 
language ... Research has, failed to identify any 
category of student for whom a bilingual education 
would be less suitable than' a monolingual 
education . . . (There is) no educational support, 



either empirical or theoretical, (for) limiting access 
to bilingual. eduCationiDy Limited-English-Prbfiqient 
* (LEP) stujdents (on'the. basis of "Enplish-superiority" 
' or use of English at home.) (Cumnrffhs'. 19B1 :42) ' 

Bilingual education in the United Slates, as in many 

countries in the world, was originally justified by citing 

the 1953 UNESCO axiom that the bSst medium for 

teaching children is theiuiipther'tongue. Reinterpreted 

"as' meaning that children should be . taught in their 

stronger language, this, theory leads directly to the 

notions of 'English-superior," "primary-language- „ 

superior," etc., -V/ith different treatments for each, 

embodied in the Proposed Lauftules. It is exemplified in . 

the quotations from Dulay and Burt and from the OPBE / 

report cited above. It is not,, however, embodied in the ^ 

Bilingual Education Act definition which, as indicated.^ 

emphasizes the limited-English proficiency of language- ; . 

minority children as the crucial characteristic, fys! 

-presently enacteci, the legislatiqn enables school; 

districts to apply for funds to develop and implement) 

bilinguah: programs for all LEP children whom theyi 

believe will benefit ,from them. As presently enacted; 

the ^gislation permits school districts. to<« apply for 

funds to develop arid implement programs in which as 

many as forty percent of the participating children are 

English-speaking, majority children. If, as affirmed at the 

1983 National Association for Bilingual Education. 

conference, bilingualism is in the national interest, then 

th6se aspects of the present Bilingual Education Act 

must be defended. All LEP children' who can benefit 

from bilingual programs must continue to be able .to 

participate in lherr>. Majority children must' continue. to 

be able to participate up to the limit of forty percent 

Only in this way will the advantages which follow from 

integrated programs for minority and majority children 

and for the wid^r society be realized. 
k 

The estimates from the 1980 Census published by the 
Bureau of the Census have the most serious . 
sociopolitical implications because of the danger of 
misinterpretation anjd because Census data have a way 
of becoming the. final wdrd. The national number about 
650,000, and equivalent -estimates fpr states and 
smaller geographic units represent' children, aged 5 to 
17, reported to speak languages other than English at 
home and not to spe§k English well or not to speak it at 
all. The 650,000 is a provisional estimate from a sample 
of the 1980 Census sample which was published last 
spring. The estimate from the whole'* sample will 
eventually be published. Final estimates from the entire 
sample are already being published for the states. In the 
CESS, there was a considerable dif/erence between the 
ratings* of children's English-speaking skills by their 
parents or guardians and objective testing of the 
school-related English skills, including reading and 
writing, of those same children. Applying the 
limited-English-proficiency (LEP) rates from the CESS to 
the total of 4.5 million school-acje children who were 
reported in the Census to speak non-English languages 
at home yields about ■ 2.9 million y children with 
limited-English proficiency as defined* in" the BilinguaJ 
Education Act. However, thesfe are on/ythe LEP dhildren* . 
reported to speak non-English languages at* home. 
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Language minority children whd speak English at.home 
'were not •included. Information .on their 
English-speaking.skills was not gathered in'the Qensus. 
Again applying rates from the CESS, it is estimated that 
there are at least 544,000 other language-miribrity 
children who only spe<Jk English, but who lack Sufficient 
proficiency in the school-related' English .skills to 
succeed -without special help in ,the English-medium 
school system. Thus, a minimum of 314 million children 
in the United State's in 1980 meet the definition in the 
' 8ilinguai Education Act. A minimum of 3.4 million 
children have insufficient English-language skills to 
succeed without assistance in the English-medium 
school, Moreover, because the precise interpretation of 
the language question in the Census ^ presently 
unknown, it is possible that there are considerably more 
school-age LEP childreajhe 3.4 million is a difference 

of 2 8 million from the-Census's published estimate of 
those reported By their households to speak languages ■' 
other than English and to lack English-speaking skills. It 
is a difference in the proportion of school-age children 
in the United States in 1980 of nearly 6 percent. Those ■ 
reported to sp'eak languages other than English at home 

' and to lack English-speaking skills constituted 1.4 
percent of the total school-age population in 1980, 

" Those estimated to be LEP constituted a minimum of. 
7.2 percent of the total school-age population. The 
differences are much more dramatic in states and 
localities With high concentrations of language- 
minority children. 

f As stated earlier in this paper; the C6SS estimates are 
' the only ones. now available. Similarly, the rates which 

were used to extrapolate the estimates of LEP children 
-from the 1980 Census*data are the only ones currently 

available. The 'CESS estimates and LEP rates are 



■ national ones. They do not reflect differences related to 
concentrations, much Tess differences which will Tbe 
reveated when the data are. analyzed by language 
aroup To remedy these problems, and to obtain Lbr • 
data Based oa. an actual sample- of 1980 Census . 
respondentsrfhe 8ureau of the. Census conducted a.' 
special study last summer for the Department- ot* 
Education, The.test uSed in the CESS was administered 
to children and an adult test, developed for this 
purpose, to language minority adults- in; a national . 
sample. Results of this study will be available laterals 
year They will make possible estimates of LEP children 
" and adults, based on the 1980 Census, for a nurnber of 
states and different language groups. They Will make 
possible new national estimates. The CESS'est.mates 
and the extrapolations from the 1980 Census based on . 
the CESS findings will serve as bench marks for 
comparison with the findings of the new,study. 



The estimates of the size of the population in need will 
play an important role in the publjd perception of the 
importance of bilingual education' and other programs' 
and services for language minor/ties. They will play an 
important role in continued pdblic support for these 
programs and services in. 1 980's. The understanding of 
the potential benefits of these programs for minority 
children and for majority children will greatly affect the 
extent to which bilingual education is continued and 
expanded. If bilingualism is in the national interest for 
all the population, then the varying estimates must.be 
understood, for wh£it they are. Narrow, concepts of 
benefit must not be allowed to obscure the extent of 
need among minority children. They must not be. 
allowed to obscure the. possibilities for minority and 
majority children to interact in mutually . rewarding 
language-learning experiences. 
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